↓ Skip to main content

Using machine learning algorithms to identify genes essential for cell survival

Overview of attention for article published in BMC Bioinformatics, October 2017
Altmetric Badge

About this Attention Score

  • Average Attention Score compared to outputs of the same age
  • Average Attention Score compared to outputs of the same age and source

Mentioned by

twitter
3 X users

Citations

dimensions_citation
6 Dimensions

Readers on

mendeley
25 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Using machine learning algorithms to identify genes essential for cell survival
Published in
BMC Bioinformatics, October 2017
DOI 10.1186/s12859-017-1799-1
Pubmed ID
Authors

Santosh Philips, Heng-Yi Wu, Lang Li

Abstract

With the explosion of data comes a proportional opportunity to identify novel knowledge with the potential for application in targeted therapies. In spite of this huge amounts of data, the solutions to treating complex disease is elusive. One reason being that these diseases are driven by a network of genes that need to be targeted in order to understand and treat them effectively. Part of the solution lies in mining and integrating information from various disciplines. Here we propose a machine learning method to mining through publicly available literature on RNA interference with the goal of identifying genes essential for cell survival. A total of 32,164 RNA interference abstracts were identified from 10.5 million pubmed abstracts (2001 - 2015). These abstracts spanned over 1467 cancer cell lines and 4373 genes representing a total of 25,891 cell gene associations. Among the 1467 cell lines 88% of them had at least 1 or up to 25 genes studied in a given cell line. Among the 4373 genes 96% of them were studied in at least 1 or up to 25 different cell lines. Identifying genes that are crucial for cell survival can be a critical piece of information especially in treating complex diseases, such as cancer. The efficacy of a therapeutic intervention is multifactorial in nature and in many cases the source of therapeutic disruption could be from an unsuspected source. Machine learning algorithms helps to narrow down the search and provides information about essential genes in different cancer types. It also provides the building blocks to generate a network of interconnected genes and processes. The information thus gained can be used to generate hypothesis which can be experimentally validated to improve our understanding of what triggers and maintains the growth of cancerous cells.

X Demographics

X Demographics

The data shown below were collected from the profiles of 3 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 25 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Unknown 25 100%

Demographic breakdown

Readers by professional status Count As %
Student > Bachelor 5 20%
Student > Ph. D. Student 5 20%
Researcher 4 16%
Student > Doctoral Student 2 8%
Student > Master 2 8%
Other 4 16%
Unknown 3 12%
Readers by discipline Count As %
Medicine and Dentistry 5 20%
Computer Science 4 16%
Agricultural and Biological Sciences 4 16%
Business, Management and Accounting 2 8%
Biochemistry, Genetics and Molecular Biology 2 8%
Other 4 16%
Unknown 4 16%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 2. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 08 October 2017.
All research outputs
#14,956,881
of 23,005,189 outputs
Outputs from BMC Bioinformatics
#5,067
of 7,312 outputs
Outputs of similar age
#190,797
of 323,064 outputs
Outputs of similar age from BMC Bioinformatics
#57
of 105 outputs
Altmetric has tracked 23,005,189 research outputs across all sources so far. This one is in the 32nd percentile – i.e., 32% of other outputs scored the same or lower than it.
So far Altmetric has tracked 7,312 research outputs from this source. They typically receive a little more attention than average, with a mean Attention Score of 5.4. This one is in the 26th percentile – i.e., 26% of its peers scored the same or lower than it.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 323,064 tracked outputs that were published within six weeks on either side of this one in any source. This one is in the 37th percentile – i.e., 37% of its contemporaries scored the same or lower than it.
We're also able to compare this research output to 105 others from the same source and published within six weeks on either side of this one. This one is in the 38th percentile – i.e., 38% of its contemporaries scored the same or lower than it.