↓ Skip to main content

The whole genome sequences and experimentally phased haplotypes of over 100 personal genomes

Overview of attention for article published in Giga Science, October 2016
Altmetric Badge

About this Attention Score

  • In the top 5% of all research outputs scored by Altmetric
  • High Attention Score compared to outputs of the same age (94th percentile)

Mentioned by

1 news outlet
1 blog
40 tweeters
1 peer review site
2 Facebook pages
4 Wikipedia pages
1 Google+ user


13 Dimensions

Readers on

37 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
The whole genome sequences and experimentally phased haplotypes of over 100 personal genomes
Published in
Giga Science, October 2016
DOI 10.1186/s13742-016-0148-z
Pubmed ID

Qing Mao, Serban Ciotlos, Rebecca Yu Zhang, Madeleine P. Ball, Robert Chin, Paolo Carnevali, Nina Barua, Staci Nguyen, Misha R. Agarwal, Tom Clegg, Abram Connelly, Ward Vandewege, Alexander Wait Zaranek, Preston W. Estep, George M. Church, Radoje Drmanac, Brock A. Peters


Since the completion of the Human Genome Project in 2003, it is estimated that more than 200,000 individual whole human genomes have been sequenced. A stunning accomplishment in such a short period of time. However, most of these were sequenced without experimental haplotype data and are therefore missing an important aspect of genome biology. In addition, much of the genomic data is not available to the public and lacks phenotypic information. As part of the Personal Genome Project, blood samples from 184 participants were collected and processed using Complete Genomics' Long Fragment Read technology. Here, we present the experimental whole genome haplotyping and sequencing of these samples to an average read coverage depth of 100X. This is approximately three-fold higher than the read coverage applied to most whole human genome assemblies and ensures the highest quality results. Currently, 114 genomes from this dataset are freely available in the GigaDB repository and are associated with rich phenotypic data; the remaining 70 should be added in the near future as they are approved through the PGP data release process. For reproducibility analyses, 20 genomes were sequenced at least twice using independent LFR barcoded libraries. Seven genomes were also sequenced using Complete Genomics' standard non-barcoded library process. In addition, we report 2.6 million high-quality, rare variants not previously identified in the Single Nucleotide Polymorphisms database or the 1000 Genomes Project Phase 3 data. These genomes represent a unique source of haplotype and phenotype data for the scientific community and should help to expand our understanding of human genome evolution and function.

Twitter Demographics

The data shown below were collected from the profiles of 40 tweeters who shared this research output. Click here to find out more about how the information was compiled.

Mendeley readers

The data shown below were compiled from readership statistics for 37 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
United States 3 8%
Hungary 1 3%
United Kingdom 1 3%
Unknown 32 86%

Demographic breakdown

Readers by professional status Count As %
Researcher 13 35%
Student > Ph. D. Student 7 19%
Other 4 11%
Student > Postgraduate 3 8%
Student > Bachelor 2 5%
Other 5 14%
Unknown 3 8%
Readers by discipline Count As %
Biochemistry, Genetics and Molecular Biology 16 43%
Agricultural and Biological Sciences 11 30%
Computer Science 2 5%
Mathematics 1 3%
Environmental Science 1 3%
Other 1 3%
Unknown 5 14%

Attention Score in Context

This research output has an Altmetric Attention Score of 41. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 17 June 2019.
All research outputs
of 15,799,426 outputs
Outputs from Giga Science
of 763 outputs
Outputs of similar age
of 271,478 outputs
Outputs of similar age from Giga Science
of 4 outputs
Altmetric has tracked 15,799,426 research outputs across all sources so far. Compared to these this one has done particularly well and is in the 96th percentile: it's in the top 5% of all research outputs ever tracked by Altmetric.
So far Altmetric has tracked 763 research outputs from this source. They typically receive a lot more attention than average, with a mean Attention Score of 23.0. This one has done well, scoring higher than 86% of its peers.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 271,478 tracked outputs that were published within six weeks on either side of this one in any source. This one has done particularly well, scoring higher than 94% of its contemporaries.
We're also able to compare this research output to 4 others from the same source and published within six weeks on either side of this one. This one has scored higher than all of them