Report for: Using machine learning algorithms to identify genes essential for cell survival

You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.

Title	Using machine learning algorithms to identify genes essential for cell survival
Published in	BMC Bioinformatics, October 2017
DOI	10.1186/s12859-017-1799-1
Pubmed ID	28984184
Authors	Santosh Philips, Heng-Yi Wu, Lang Li
Abstract	With the explosion of data comes a proportional opportunity to identify novel knowledge with the potential for application in targeted therapies. In spite of this huge amounts of data, the solutions to treating complex disease is elusive. One reason being that these diseases are driven by a network of genes that need to be targeted in order to understand and treat them effectively. Part of the solution lies in mining and integrating information from various disciplines. Here we propose a machine learning method to mining through publicly available literature on RNA interference with the goal of identifying genes essential for cell survival. A total of 32,164 RNA interference abstracts were identified from 10.5 million pubmed abstracts (2001 - 2015). These abstracts spanned over 1467 cancer cell lines and 4373 genes representing a total of 25,891 cell gene associations. Among the 1467 cell lines 88% of them had at least 1 or up to 25 genes studied in a given cell line. Among the 4373 genes 96% of them were studied in at least 1 or up to 25 different cell lines. Identifying genes that are crucial for cell survival can be a critical piece of information especially in treating complex diseases, such as cancer. The efficacy of a therapeutic intervention is multifactorial in nature and in many cases the source of therapeutic disruption could be from an unsuspected source. Machine learning algorithms helps to narrow down the search and provides information about essential genes in different cancer types. It also provides the building blocks to generate a network of interconnected genes and processes. The information thus gained can be used to generate hypothesis which can be experimentally validated to improve our understanding of what triggers and maintains the growth of cancerous cells.

View on publisher site Alert me about new mentions

X Demographics

The data shown below were collected from the profiles of 3 X users who shared this research output. Click here to find out more about how the information was compiled.

Geographical breakdown

Country	Count	As %
Spain	1	33%
Unknown	2	67%

Demographic breakdown

Type	Count	As %
Scientists	2	67%
Members of the public	1	33%

Mendeley readers

The data shown below were compiled from readership statistics for 25 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country	Count	As %
Unknown	25	100%

Demographic breakdown

Readers by professional status	Count	As %
Student > Bachelor	5	20%
Student > Ph. D. Student	5	20%
Researcher	4	16%
Student > Doctoral Student	2	8%
Student > Master	2	8%
Other	4	16%
Unknown	3	12%

Readers by discipline	Count	As %
Medicine and Dentistry	5	20%
Computer Science	4	16%
Agricultural and Biological Sciences	4	16%
Business, Management and Accounting	2	8%
Biochemistry, Genetics and Molecular Biology	2	8%
Other	4	16%
Unknown	4	16%

Attention Score in Context

This research output has an Altmetric Attention Score of 2. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 08 October 2017.

All research outputs

#14,956,881

of 23,005,189 outputs

Outputs from BMC Bioinformatics

#5,067

of 7,312 outputs

Outputs of similar age

#190,797

of 323,064 outputs

Outputs of similar age from BMC Bioinformatics

#57

of 105 outputs

Altmetric has tracked 23,005,189 research outputs across all sources so far. This one is in the 32nd percentile – i.e., 32% of other outputs scored the same or lower than it.

So far Altmetric has tracked 7,312 research outputs from this source. They typically receive a little more attention than average, with a mean Attention Score of 5.4. This one is in the 26th percentile – i.e., 26% of its peers scored the same or lower than it.

Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 323,064 tracked outputs that were published within six weeks on either side of this one in any source. This one is in the 37th percentile – i.e., 37% of its contemporaries scored the same or lower than it.

We're also able to compare this research output to 105 others from the same source and published within six weeks on either side of this one. This one is in the 38th percentile – i.e., 38% of its contemporaries scored the same or lower than it.

Using machine learning algorithms to identify genes essential for cell survival

About this Attention Score

Mentioned by

Citations

Readers on

X Demographics

Geographical breakdown

Demographic breakdown

Mendeley readers

Geographical breakdown

Demographic breakdown

Attention Score in Context