In:
Nature Communications, Springer Science and Business Media LLC, Vol. 13, No. 1 ( 2022-05-26)
Kurzfassung:
We initiate the Westlake BioBank for Chinese (WBBC) pilot project with 4,535 whole-genome sequencing (WGS) individuals and 5,841 high-density genotyping individuals, and identify 81.5 million SNPs and INDELs, of which 38.5% are absent in dbSNP Build 151. We provide a population-specific reference panel and an online imputation server ( https://wbbc.westlake.edu.cn/ ) which could yield substantial improvement of imputation performance in Chinese population, especially for low-frequency and rare variants. By analyzing the singleton density of the WGS data, we find selection signatures in SNX29 , DNAH1 and WDR1 genes, and the derived alleles of the alcohol metabolism genes ( ADH1A and ADH1B ) emerge around 7,000 years ago and tend to be more common from 4,000 years ago in East Asia. Genetic evidence supports the corresponding geographical boundaries of the Qinling-Huaihe Line and Nanling Mountains, which separate the Han Chinese into subgroups, and we reveal that North Han was more homogeneous than South Han.
Materialart:
Online-Ressource
ISSN:
2041-1723
DOI:
10.1038/s41467-022-30526-x
Sprache:
Englisch
Verlag:
Springer Science and Business Media LLC
Publikationsdatum:
2022
ZDB Id:
2553671-0