↓ Skip to main content

MG-Digger: An Automated Pipeline to Search for Giant Virus-Related Sequences in Metagenomes

Overview of attention for article published in Frontiers in Microbiology, March 2016
Altmetric Badge

About this Attention Score

  • In the top 25% of all research outputs scored by Altmetric
  • High Attention Score compared to outputs of the same age (80th percentile)
  • Good Attention Score compared to outputs of the same age and source (78th percentile)

Mentioned by

blogs
1 blog
twitter
3 X users

Citations

dimensions_citation
26 Dimensions

Readers on

mendeley
77 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
MG-Digger: An Automated Pipeline to Search for Giant Virus-Related Sequences in Metagenomes
Published in
Frontiers in Microbiology, March 2016
DOI 10.3389/fmicb.2016.00428
Pubmed ID
Authors

Jonathan Verneau, Anthony Levasseur, Didier Raoult, Bernard La Scola, Philippe Colson

Abstract

The number of metagenomic studies conducted each year is growing dramatically. Storage and analysis of such big data is difficult and time-consuming. Interestingly, analysis shows that environmental and human metagenomes include a significant amount of non-annotated sequences, representing a 'dark matter.' We established a bioinformatics pipeline that automatically detects metagenome reads matching query sequences from a given set and applied this tool to the detection of sequences matching large and giant DNA viral members of the proposed order Megavirales or virophages. A total of 1,045 environmental and human metagenomes (≈ 1 Terabase) were collected, processed, and stored on our bioinformatics server. In addition, nucleotide and protein sequences from 93 Megavirales representatives, including 19 giant viruses of amoeba, and 5 virophages, were collected. The pipeline was generated by scripts written in Python language and entitled MG-Digger. Metagenomes previously found to contain megavirus-like sequences were tested as controls. MG-Digger was able to annotate 100s of metagenome sequences as best matching those of giant viruses. These sequences were most often found to be similar to phycodnavirus or mimivirus sequences, but included reads related to recently available pandoraviruses, Pithovirus sibericum, and faustoviruses. Compared to other tools, MG-Digger combined stand-alone use on Linux or Windows operating systems through a user-friendly interface, implementation of ready-to-use customized metagenome databases and query sequence databases, adjustable parameters for BLAST searches, and creation of output files containing selected reads with best match identification. Compared to Metavir 2, a reference tool in viral metagenome analysis, MG-Digger detected 8% more true positive Megavirales-related reads in a control metagenome. The present work shows that massive, automated and recurrent analyses of metagenomes are effective in improving knowledge about the presence and prevalence of giant viruses in the environment and the human body.

X Demographics

X Demographics

The data shown below were collected from the profiles of 3 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 77 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Brazil 2 3%
Portugal 1 1%
Canada 1 1%
South Africa 1 1%
Unknown 72 94%

Demographic breakdown

Readers by professional status Count As %
Researcher 14 18%
Student > Ph. D. Student 13 17%
Student > Master 13 17%
Student > Doctoral Student 8 10%
Student > Bachelor 6 8%
Other 11 14%
Unknown 12 16%
Readers by discipline Count As %
Agricultural and Biological Sciences 27 35%
Biochemistry, Genetics and Molecular Biology 16 21%
Immunology and Microbiology 4 5%
Environmental Science 4 5%
Computer Science 2 3%
Other 9 12%
Unknown 15 19%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 9. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 28 March 2016.
All research outputs
#3,614,049
of 22,856,968 outputs
Outputs from Frontiers in Microbiology
#3,300
of 24,866 outputs
Outputs of similar age
#58,154
of 301,000 outputs
Outputs of similar age from Frontiers in Microbiology
#114
of 545 outputs
Altmetric has tracked 22,856,968 research outputs across all sources so far. Compared to these this one has done well and is in the 84th percentile: it's in the top 25% of all research outputs ever tracked by Altmetric.
So far Altmetric has tracked 24,866 research outputs from this source. They typically receive a little more attention than average, with a mean Attention Score of 6.3. This one has done well, scoring higher than 86% of its peers.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 301,000 tracked outputs that were published within six weeks on either side of this one in any source. This one has done well, scoring higher than 80% of its contemporaries.
We're also able to compare this research output to 545 others from the same source and published within six weeks on either side of this one. This one has done well, scoring higher than 78% of its contemporaries.