↓ Skip to main content

Determining Effects of Non-synonymous SNPs on Protein-Protein Interactions using Supervised and Semi-supervised Learning

Overview of attention for article published in PLoS Computational Biology, May 2014
Altmetric Badge

About this Attention Score

  • In the top 25% of all research outputs scored by Altmetric
  • High Attention Score compared to outputs of the same age (92nd percentile)
  • High Attention Score compared to outputs of the same age and source (83rd percentile)

Mentioned by

twitter
26 X users
patent
2 patents
googleplus
1 Google+ user

Citations

dimensions_citation
73 Dimensions

Readers on

mendeley
156 Mendeley
citeulike
6 CiteULike
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Determining Effects of Non-synonymous SNPs on Protein-Protein Interactions using Supervised and Semi-supervised Learning
Published in
PLoS Computational Biology, May 2014
DOI 10.1371/journal.pcbi.1003592
Pubmed ID
Authors

Nan Zhao, Jing Ginger Han, Chi-Ren Shyu, Dmitry Korkin

Abstract

Single nucleotide polymorphisms (SNPs) are among the most common types of genetic variation in complex genetic disorders. A growing number of studies link the functional role of SNPs with the networks and pathways mediated by the disease-associated genes. For example, many non-synonymous missense SNPs (nsSNPs) have been found near or inside the protein-protein interaction (PPI) interfaces. Determining whether such nsSNP will disrupt or preserve a PPI is a challenging task to address, both experimentally and computationally. Here, we present this task as three related classification problems, and develop a new computational method, called the SNP-IN tool (non-synonymous SNP INteraction effect predictor). Our method predicts the effects of nsSNPs on PPIs, given the interaction's structure. It leverages supervised and semi-supervised feature-based classifiers, including our new Random Forest self-learning protocol. The classifiers are trained based on a dataset of comprehensive mutagenesis studies for 151 PPI complexes, with experimentally determined binding affinities of the mutant and wild-type interactions. Three classification problems were considered: (1) a 2-class problem (strengthening/weakening PPI mutations), (2) another 2-class problem (mutations that disrupt/preserve a PPI), and (3) a 3-class classification (detrimental/neutral/beneficial mutation effects). In total, 11 different supervised and semi-supervised classifiers were trained and assessed resulting in a promising performance, with the weighted f-measure ranging from 0.87 for Problem 1 to 0.70 for the most challenging Problem 3. By integrating prediction results of the 2-class classifiers into the 3-class classifier, we further improved its performance for Problem 3. To demonstrate the utility of SNP-IN tool, it was applied to study the nsSNP-induced rewiring of two disease-centered networks. The accurate and balanced performance of SNP-IN tool makes it readily available to study the rewiring of large-scale protein-protein interaction networks, and can be useful for functional annotation of disease-associated SNPs. SNIP-IN tool is freely accessible as a web-server at http://korkinlab.org/snpintool/.

X Demographics

X Demographics

The data shown below were collected from the profiles of 26 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 156 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
United States 6 4%
United Kingdom 2 1%
India 1 <1%
Spain 1 <1%
Netherlands 1 <1%
Unknown 145 93%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 33 21%
Student > Master 31 20%
Researcher 21 13%
Student > Bachelor 14 9%
Professor 9 6%
Other 21 13%
Unknown 27 17%
Readers by discipline Count As %
Agricultural and Biological Sciences 45 29%
Biochemistry, Genetics and Molecular Biology 32 21%
Computer Science 22 14%
Engineering 9 6%
Medicine and Dentistry 6 4%
Other 12 8%
Unknown 30 19%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 21. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 03 January 2023.
All research outputs
#1,783,758
of 25,394,764 outputs
Outputs from PLoS Computational Biology
#1,538
of 8,964 outputs
Outputs of similar age
#17,558
of 242,247 outputs
Outputs of similar age from PLoS Computational Biology
#26
of 153 outputs
Altmetric has tracked 25,394,764 research outputs across all sources so far. Compared to these this one has done particularly well and is in the 92nd percentile: it's in the top 10% of all research outputs ever tracked by Altmetric.
So far Altmetric has tracked 8,964 research outputs from this source. They typically receive a lot more attention than average, with a mean Attention Score of 20.4. This one has done well, scoring higher than 82% of its peers.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 242,247 tracked outputs that were published within six weeks on either side of this one in any source. This one has done particularly well, scoring higher than 92% of its contemporaries.
We're also able to compare this research output to 153 others from the same source and published within six weeks on either side of this one. This one has done well, scoring higher than 83% of its contemporaries.