↓ Skip to main content

Robust Inference of Genetic Exchange Communities from Microbial Genomes Using TF-IDF

Overview of attention for article published in Frontiers in Microbiology, January 2017
Altmetric Badge

About this Attention Score

  • Good Attention Score compared to outputs of the same age (73rd percentile)
  • Above-average Attention Score compared to outputs of the same age and source (58th percentile)

Mentioned by

blogs
1 blog

Citations

dimensions_citation
12 Dimensions

Readers on

mendeley
27 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Robust Inference of Genetic Exchange Communities from Microbial Genomes Using TF-IDF
Published in
Frontiers in Microbiology, January 2017
DOI 10.3389/fmicb.2017.00021
Pubmed ID
Authors

Yingnan Cong, Yao-ban Chan, Charles A. Phillips, Michael A. Langston, Mark A. Ragan

Abstract

Bacteria and archaea can exchange genetic material across lineages through processes of lateral genetic transfer (LGT). Collectively, these exchange relationships can be modeled as a network and analyzed using concepts from graph theory. In particular, densely connected regions within an LGT network have been defined as genetic exchange communities (GECs). However, it has been problematic to construct networks in which edges solely represent LGT. Here we apply term frequency-inverse document frequency (TF-IDF), an alignment-free method originating from document analysis, to infer regions of lateral origin in bacterial genomes. We examine four empirical datasets of different size (number of genomes) and phyletic breadth, varying a key parameter (word length k) within bounds established in previous work. We map the inferred lateral regions to genes in recipient genomes, and construct networks in which the nodes are groups of genomes, and the edges natively represent LGT. We then extract maximum and maximal cliques (i.e., GECs) from these graphs, and identify nodes that belong to GECs across a wide range of k. Most surviving lateral transfer has happened within these GECs. Using Gene Ontology enrichment tests we demonstrate that biological processes associated with metabolism, regulation and transport are often over-represented among the genes affected by LGT within these communities. These enrichments are largely robust to change of k.

Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 27 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Unknown 27 100%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 10 37%
Professor 3 11%
Student > Bachelor 3 11%
Other 2 7%
Researcher 2 7%
Other 4 15%
Unknown 3 11%
Readers by discipline Count As %
Agricultural and Biological Sciences 7 26%
Biochemistry, Genetics and Molecular Biology 5 19%
Engineering 3 11%
Computer Science 3 11%
Chemical Engineering 2 7%
Other 4 15%
Unknown 3 11%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 6. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 07 February 2017.
All research outputs
#5,781,945
of 22,940,083 outputs
Outputs from Frontiers in Microbiology
#5,499
of 24,975 outputs
Outputs of similar age
#107,393
of 417,500 outputs
Outputs of similar age from Frontiers in Microbiology
#156
of 390 outputs
Altmetric has tracked 22,940,083 research outputs across all sources so far. This one has received more attention than most of these and is in the 74th percentile.
So far Altmetric has tracked 24,975 research outputs from this source. They typically receive a little more attention than average, with a mean Attention Score of 6.3. This one has done well, scoring higher than 77% of its peers.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 417,500 tracked outputs that were published within six weeks on either side of this one in any source. This one has gotten more attention than average, scoring higher than 73% of its contemporaries.
We're also able to compare this research output to 390 others from the same source and published within six weeks on either side of this one. This one has gotten more attention than average, scoring higher than 58% of its contemporaries.