Title |
De novo assembly of a haplotype-resolved human genome
|
---|---|
Published in |
Nature Biotechnology, May 2015
|
DOI | 10.1038/nbt.3200 |
Pubmed ID | |
Authors |
Hongzhi Cao, Honglong Wu, Ruibang Luo, Shujia Huang, Yuhui Sun, Xin Tong, Yinlong Xie, Binghang Liu, Hailong Yang, Hancheng Zheng, Jian Li, Bo Li, Yu Wang, Fang Yang, Peng Sun, Siyang Liu, Peng Gao, Haodong Huang, Jing Sun, Dan Chen, Guangzhu He, Weihua Huang, Zheng Huang, Yue Li, Laurent C A M Tellier, Xiao Liu, Qiang Feng, Xun Xu, Xiuqing Zhang, Lars Bolund, Anders Krogh, Karsten Kristiansen, Radoje Drmanac, Snezana Drmanac, Rasmus Nielsen, Songgang Li, Jian Wang, Huanming Yang, Yingrui Li, Gane Ka-Shu Wong, Jun Wang |
Abstract |
The human genome is diploid, and knowledge of the variants on each chromosome is important for the interpretation of genomic information. Here we report the assembly of a haplotype-resolved diploid genome without using a reference genome. Our pipeline relies on fosmid pooling together with whole-genome shotgun strategies, based solely on next-generation sequencing and hierarchical assembly methods. We applied our sequencing method to the genome of an Asian individual and generated a 5.15-Gb assembled genome with a haplotype N50 of 484 kb. Our analysis identified previously undetected indels and 7.49 Mb of novel coding sequences that could not be aligned to the human reference genome, which include at least six predicted genes. This haplotype-resolved genome represents the most complete de novo human genome assembly to date. Application of our approach to identify individual haplotype differences should aid in translating genotypes to phenotypes for the development of personalized medicine. |
X Demographics
Geographical breakdown
Country | Count | As % |
---|---|---|
United States | 39 | 29% |
United Kingdom | 12 | 9% |
India | 4 | 3% |
France | 4 | 3% |
Germany | 4 | 3% |
Netherlands | 3 | 2% |
Spain | 3 | 2% |
Finland | 2 | 1% |
Thailand | 1 | <1% |
Other | 14 | 10% |
Unknown | 50 | 37% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Scientists | 67 | 49% |
Members of the public | 63 | 46% |
Science communicators (journalists, bloggers, editors) | 3 | 2% |
Practitioners (doctors, other healthcare professionals) | 3 | 2% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
United States | 5 | 2% |
France | 2 | <1% |
Netherlands | 2 | <1% |
Spain | 2 | <1% |
United Kingdom | 2 | <1% |
Portugal | 1 | <1% |
Canada | 1 | <1% |
New Zealand | 1 | <1% |
Switzerland | 1 | <1% |
Other | 4 | 2% |
Unknown | 217 | 91% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Researcher | 57 | 24% |
Student > Ph. D. Student | 54 | 23% |
Other | 22 | 9% |
Student > Master | 22 | 9% |
Student > Bachelor | 13 | 5% |
Other | 37 | 16% |
Unknown | 33 | 14% |
Readers by discipline | Count | As % |
---|---|---|
Agricultural and Biological Sciences | 93 | 39% |
Biochemistry, Genetics and Molecular Biology | 66 | 28% |
Computer Science | 15 | 6% |
Medicine and Dentistry | 5 | 2% |
Neuroscience | 4 | 2% |
Other | 14 | 6% |
Unknown | 41 | 17% |