↓ Skip to main content

Common and phylogenetically widespread coding for peptides by bacterial small RNAs

Overview of attention for article published in BMC Genomics, July 2017
Altmetric Badge

About this Attention Score

  • Good Attention Score compared to outputs of the same age (67th percentile)
  • Good Attention Score compared to outputs of the same age and source (70th percentile)

Mentioned by

twitter
12 X users

Citations

dimensions_citation
28 Dimensions

Readers on

mendeley
70 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Common and phylogenetically widespread coding for peptides by bacterial small RNAs
Published in
BMC Genomics, July 2017
DOI 10.1186/s12864-017-3932-y
Pubmed ID
Authors

Robin C. Friedman, Stefan Kalkhof, Olivia Doppelt-Azeroual, Stephan A. Mueller, Martina Chovancová, Martin von Bergen, Benno Schwikowski

Abstract

While eukaryotic noncoding RNAs have recently received intense scrutiny, it is becoming clear that bacterial transcription is at least as pervasive. Bacterial small RNAs and antisense RNAs (sRNAs) are often assumed to be noncoding, due to their lack of long open reading frames (ORFs). However, there are numerous examples of sRNAs encoding for small proteins, whether or not they also have a regulatory role at the RNA level. Here, we apply flexible machine learning techniques based on sequence features and comparative genomics to quantify the prevalence of sRNA ORFs under natural selection to maintain protein-coding function in 14 phylogenetically diverse bacteria. Importantly, we quantify uncertainty in our predictions, and follow up on them using mass spectrometry proteomics and comparison to datasets including ribosome profiling. A majority of annotated sRNAs have at least one ORF between 10 and 50 amino acids long, and we conservatively predict that 409±191.7 unannotated sRNA ORFs are under selection to maintain coding (mean estimate and 95% confidence interval), an average of 29 per species considered here. This implies that overall at least 10.3±0.5% of sRNAs have a coding ORF, and in some species around 20% do. 165±69 of these novel coding ORFs have some antisense overlap to annotated ORFs. As experimental validation, many of our predictions are translated in published ribosome profiling data and are identified via mass spectrometry shotgun proteomics. B. subtilis sRNAs with coding ORFs are enriched for high expression in biofilms and confluent growth, and S. pneumoniae sRNAs with coding ORFs are involved in virulence. sRNA coding ORFs are enriched for transmembrane domains and many are predicted novel components of type I toxin/antitoxin systems. We predict over two dozen new protein-coding genes per bacterial species, but crucially also quantified the uncertainty in this estimate. Our predictions for sRNA coding ORFs, along with predicted novel type I toxins and tools for sorting and visualizing genomic context, are freely available in a user-friendly format at http://disco-bac.web.pasteur.fr. We expect these easily-accessible predictions to be a valuable tool for the study not only of bacterial sRNAs and type I toxin-antitoxin systems, but also of bacterial genetics and genomics.

X Demographics

X Demographics

The data shown below were collected from the profiles of 12 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 70 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Unknown 70 100%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 14 20%
Student > Bachelor 13 19%
Researcher 9 13%
Other 5 7%
Student > Master 4 6%
Other 12 17%
Unknown 13 19%
Readers by discipline Count As %
Biochemistry, Genetics and Molecular Biology 26 37%
Agricultural and Biological Sciences 14 20%
Computer Science 3 4%
Immunology and Microbiology 2 3%
Engineering 2 3%
Other 7 10%
Unknown 16 23%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 5. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 15 February 2018.
All research outputs
#6,642,268
of 23,881,329 outputs
Outputs from BMC Genomics
#2,805
of 10,793 outputs
Outputs of similar age
#102,193
of 316,256 outputs
Outputs of similar age from BMC Genomics
#62
of 215 outputs
Altmetric has tracked 23,881,329 research outputs across all sources so far. This one has received more attention than most of these and is in the 71st percentile.
So far Altmetric has tracked 10,793 research outputs from this source. They receive a mean Attention Score of 4.8. This one has gotten more attention than average, scoring higher than 73% of its peers.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 316,256 tracked outputs that were published within six weeks on either side of this one in any source. This one has gotten more attention than average, scoring higher than 67% of its contemporaries.
We're also able to compare this research output to 215 others from the same source and published within six weeks on either side of this one. This one has gotten more attention than average, scoring higher than 70% of its contemporaries.