↓ Skip to main content

The impact of read length on quantification of differentially expressed genes and splice junction detection

Overview of attention for article published in Genome Biology, June 2015
Altmetric Badge

About this Attention Score

  • In the top 5% of all research outputs scored by Altmetric
  • High Attention Score compared to outputs of the same age (95th percentile)
  • High Attention Score compared to outputs of the same age and source (85th percentile)

Mentioned by

twitter
69 X users
facebook
1 Facebook page
wikipedia
3 Wikipedia pages
video
1 YouTube creator

Citations

dimensions_citation
103 Dimensions

Readers on

mendeley
384 Mendeley
citeulike
2 CiteULike
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
The impact of read length on quantification of differentially expressed genes and splice junction detection
Published in
Genome Biology, June 2015
DOI 10.1186/s13059-015-0697-y
Pubmed ID
Authors

Sagar Chhangawala, Gabe Rudy, Christopher E. Mason, Jeffrey A. Rosenfeld

Abstract

The initial next-generation sequencing technologies produced reads of 25 or 36 bp, and only from a single-end of the library sequence. Currently, it is possible to reliably produce 300 bp paired-end sequences for RNA expression analysis. While read lengths have consistently increased, people have assumed that longer reads are more informative and that paired-end reads produce better results than single-end reads. We used paired-end 101 bp reads and trimmed them to simulate different read lengths, and also separated the pairs to produce single-end reads. For each read-length and paired status, we evaluated differential expression levels between two standard samples and compared the results to those obtained by qPCR. We found that, with the exception of 25 bp reads, there is little difference for the detection of differential expression regardless of the read length. Once single-end reads are at a length of 50 bp, the results do not change substantially for any level up to, and including, 100 bp paired-end. However, splice junction detection significantly improves as the read length increases with 100 bp paired-end showing the best performance. We performed the same analysis on two ENCODE samples and found consistent results confirming that our conclusions have broad application. A researcher could save substantial resources by using 50 bp single-end reads for differential expression analysis instead of using longer reads. However, splicing detection is unquestionably improved by paired-end and longer reads. Therefore, an appropriate read length should be used based on the final goal of the study.

X Demographics

X Demographics

The data shown below were collected from the profiles of 69 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 384 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
United States 5 1%
Germany 4 1%
United Kingdom 3 <1%
Norway 1 <1%
South Africa 1 <1%
Canada 1 <1%
Switzerland 1 <1%
Argentina 1 <1%
Luxembourg 1 <1%
Other 0 0%
Unknown 366 95%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 91 24%
Researcher 86 22%
Student > Master 53 14%
Student > Bachelor 42 11%
Student > Doctoral Student 15 4%
Other 42 11%
Unknown 55 14%
Readers by discipline Count As %
Agricultural and Biological Sciences 132 34%
Biochemistry, Genetics and Molecular Biology 112 29%
Computer Science 20 5%
Medicine and Dentistry 13 3%
Engineering 9 2%
Other 37 10%
Unknown 61 16%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 41. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 07 April 2023.
All research outputs
#1,036,048
of 25,845,895 outputs
Outputs from Genome Biology
#726
of 4,524 outputs
Outputs of similar age
#12,088
of 279,134 outputs
Outputs of similar age from Genome Biology
#10
of 67 outputs
Altmetric has tracked 25,845,895 research outputs across all sources so far. Compared to these this one has done particularly well and is in the 95th percentile: it's in the top 5% of all research outputs ever tracked by Altmetric.
So far Altmetric has tracked 4,524 research outputs from this source. They typically receive a lot more attention than average, with a mean Attention Score of 27.3. This one has done well, scoring higher than 83% of its peers.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 279,134 tracked outputs that were published within six weeks on either side of this one in any source. This one has done particularly well, scoring higher than 95% of its contemporaries.
We're also able to compare this research output to 67 others from the same source and published within six weeks on either side of this one. This one has done well, scoring higher than 85% of its contemporaries.