↓ Skip to main content

Augmenting Microarray Data with Literature-Based Knowledge to Enhance Gene Regulatory Network Inference

Overview of attention for article published in PLoS Computational Biology, June 2014
Altmetric Badge

About this Attention Score

  • Above-average Attention Score compared to outputs of the same age (52nd percentile)
  • Average Attention Score compared to outputs of the same age and source

Mentioned by

twitter
6 X users
googleplus
1 Google+ user

Citations

dimensions_citation
31 Dimensions

Readers on

mendeley
71 Mendeley
citeulike
1 CiteULike
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Augmenting Microarray Data with Literature-Based Knowledge to Enhance Gene Regulatory Network Inference
Published in
PLoS Computational Biology, June 2014
DOI 10.1371/journal.pcbi.1003666
Pubmed ID
Authors

Guocai Chen, Michael J. Cairelli, Halil Kilicoglu, Dongwook Shin, Thomas C. Rindflesch

Abstract

Gene regulatory networks are a crucial aspect of systems biology in describing molecular mechanisms of the cell. Various computational models rely on random gene selection to infer such networks from microarray data. While incorporation of prior knowledge into data analysis has been deemed important, in practice, it has generally been limited to referencing genes in probe sets and using curated knowledge bases. We investigate the impact of augmenting microarray data with semantic relations automatically extracted from the literature, with the view that relations encoding gene/protein interactions eliminate the need for random selection of components in non-exhaustive approaches, producing a more accurate model of cellular behavior. A genetic algorithm is then used to optimize the strength of interactions using microarray data and an artificial neural network fitness function. The result is a directed and weighted network providing the individual contribution of each gene to its target. For testing, we used invasive ductile carcinoma of the breast to query the literature and a microarray set containing gene expression changes in these cells over several time points. Our model demonstrates significantly better fitness than the state-of-the-art model, which relies on an initial random selection of genes. Comparison to the component pathways of the KEGG Pathways in Cancer map reveals that the resulting networks contain both known and novel relationships. The p53 pathway results were manually validated in the literature. 60% of non-KEGG relationships were supported (74% for highly weighted interactions). The method was then applied to yeast data and our model again outperformed the comparison model. Our results demonstrate the advantage of combining gene interactions extracted from the literature in the form of semantic relations with microarray analysis in generating contribution-weighted gene regulatory networks. This methodology can make a significant contribution to understanding the complex interactions involved in cellular behavior and molecular physiology.

X Demographics

X Demographics

The data shown below were collected from the profiles of 6 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 71 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Spain 1 1%
United States 1 1%
Netherlands 1 1%
Slovenia 1 1%
Unknown 67 94%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 17 24%
Researcher 15 21%
Student > Master 11 15%
Student > Bachelor 6 8%
Other 4 6%
Other 9 13%
Unknown 9 13%
Readers by discipline Count As %
Agricultural and Biological Sciences 19 27%
Computer Science 14 20%
Biochemistry, Genetics and Molecular Biology 13 18%
Medicine and Dentistry 9 13%
Chemical Engineering 1 1%
Other 3 4%
Unknown 12 17%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 3. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 20 June 2014.
All research outputs
#8,535,684
of 25,374,917 outputs
Outputs from PLoS Computational Biology
#5,638
of 8,960 outputs
Outputs of similar age
#80,107
of 243,585 outputs
Outputs of similar age from PLoS Computational Biology
#73
of 142 outputs
Altmetric has tracked 25,374,917 research outputs across all sources so far. This one is in the 43rd percentile – i.e., 43% of other outputs scored the same or lower than it.
So far Altmetric has tracked 8,960 research outputs from this source. They typically receive a lot more attention than average, with a mean Attention Score of 20.4. This one is in the 33rd percentile – i.e., 33% of its peers scored the same or lower than it.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 243,585 tracked outputs that were published within six weeks on either side of this one in any source. This one has gotten more attention than average, scoring higher than 52% of its contemporaries.
We're also able to compare this research output to 142 others from the same source and published within six weeks on either side of this one. This one is in the 47th percentile – i.e., 47% of its contemporaries scored the same or lower than it.