↓ Skip to main content

Toward an Efficient Method of Identifying Core Genes for Evolutionary and Functional Microbial Phylogenies

Overview of attention for article published in PLOS ONE, September 2011
Altmetric Badge

Mentioned by

twitter
1 X user

Citations

dimensions_citation
90 Dimensions

Readers on

mendeley
248 Mendeley
citeulike
6 CiteULike
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Toward an Efficient Method of Identifying Core Genes for Evolutionary and Functional Microbial Phylogenies
Published in
PLOS ONE, September 2011
DOI 10.1371/journal.pone.0024704
Pubmed ID
Authors

Nicola Segata, Curtis Huttenhower

Abstract

Microbial community metagenomes and individual microbial genomes are becoming increasingly accessible by means of high-throughput sequencing. Assessing organismal membership within a community is typically performed using one or a few taxonomic marker genes such as the 16S rDNA, and these same genes are also employed to reconstruct molecular phylogenies. There is thus a growing need to bioinformatically catalog strongly conserved core genes that can serve as effective taxonomic markers, to assess the agreement among phylogenies generated from different core gene, and to characterize the biological functions enriched within core genes and thus conserved throughout large microbial clades. We present a method to recursively identify core genes (i.e. genes ubiquitous within a microbial clade) in high-throughput from a large number of complete input genomes. We analyzed over 1,100 genomes to produce core gene sets spanning 2,861 bacterial and archaeal clades, ranging in size from one to >2,000 genes in inverse correlation with the α-diversity (total phylogenetic branch length) spanned by each clade. These cores are enriched as expected for housekeeping functions including translation, transcription, and replication, in addition to significant representations of regulatory, chaperone, and conserved uncharacterized proteins. In agreement with previous manually curated core gene sets, phylogenies constructed from one or more of these core genes agree with those built using 16S rDNA sequence similarity, suggesting that systematic core gene selection can be used to optimize both comparative genomics and determination of microbial community structure. Finally, we examine functional phylogenies constructed by clustering genomes by the presence or absence of orthologous gene families and show that they provide an informative complement to standard sequence-based molecular phylogenies.

X Demographics

X Demographics

The data shown below were collected from the profile of 1 X user who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 248 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
United States 13 5%
Sweden 2 <1%
Germany 2 <1%
Spain 2 <1%
Australia 1 <1%
Belgium 1 <1%
India 1 <1%
United Kingdom 1 <1%
Denmark 1 <1%
Other 0 0%
Unknown 224 90%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 67 27%
Researcher 52 21%
Student > Master 23 9%
Student > Doctoral Student 20 8%
Student > Postgraduate 13 5%
Other 47 19%
Unknown 26 10%
Readers by discipline Count As %
Agricultural and Biological Sciences 123 50%
Biochemistry, Genetics and Molecular Biology 35 14%
Computer Science 12 5%
Immunology and Microbiology 12 5%
Environmental Science 6 2%
Other 18 7%
Unknown 42 17%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 1. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 26 September 2011.
All research outputs
#18,295,723
of 22,651,245 outputs
Outputs from PLOS ONE
#153,642
of 193,366 outputs
Outputs of similar age
#104,565
of 126,005 outputs
Outputs of similar age from PLOS ONE
#2,017
of 2,503 outputs
Altmetric has tracked 22,651,245 research outputs across all sources so far. This one is in the 11th percentile – i.e., 11% of other outputs scored the same or lower than it.
So far Altmetric has tracked 193,366 research outputs from this source. They typically receive a lot more attention than average, with a mean Attention Score of 15.0. This one is in the 10th percentile – i.e., 10% of its peers scored the same or lower than it.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 126,005 tracked outputs that were published within six weeks on either side of this one in any source. This one is in the 8th percentile – i.e., 8% of its contemporaries scored the same or lower than it.
We're also able to compare this research output to 2,503 others from the same source and published within six weeks on either side of this one. This one is in the 7th percentile – i.e., 7% of its contemporaries scored the same or lower than it.