↓ Skip to main content

Single Nucleotide Polymorphism relevance learning with Random Forests for Type 2 diabetes risk prediction

Overview of attention for article published in Artificial Intelligence in Medicine, September 2017
Altmetric Badge

About this Attention Score

  • In the top 25% of all research outputs scored by Altmetric
  • Good Attention Score compared to outputs of the same age (78th percentile)
  • High Attention Score compared to outputs of the same age and source (81st percentile)

Mentioned by

twitter
10 X users

Citations

dimensions_citation
64 Dimensions

Readers on

mendeley
184 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Single Nucleotide Polymorphism relevance learning with Random Forests for Type 2 diabetes risk prediction
Published in
Artificial Intelligence in Medicine, September 2017
DOI 10.1016/j.artmed.2017.09.005
Pubmed ID
Authors

Beatriz López, Ferran Torrent-Fontbona, Ramón Viñas, José Manuel Fernández-Real

Abstract

The use of artificial intelligence techniques to find out which Single Nucleotide Polymorphisms (SNPs) promote the development of a disease is one of the features of medical research, as such techniques may potentially aid early diagnosis and help in the prescription of preventive measures. In particular, the aim is to help physicians to identify the relevant SNPs related to Type 2 diabetes, and to build a decision-support tool for risk prediction. We use the Random Forest (RF) technique in order to search for the most important attributes (SNPs) related to diabetes, giving a weight (degree of importance), ranging between 0 and 1, to each attribute. Support Vector Machines and Logistic Regression have also been used since they are two other machine learning techniques that are well-established in the health community. Their performance has been compared to that achieved by RF. Furthermore, the relevance of the attributes obtained through the use of RF has then been used to perform predictions with k-Nearest Neighbour method weighting attributes in the similarity measure according to the relevance of the attributes with RF. Testing is performed on a set of 677 subjects. RF is able to handle the complexity of features' interactions, overfitting, and unknown attribute values, providing the SNPs' relevance with an up to 0.89 area under the ROC curve in terms of risk prediction. RF outperforms all the other tested machine learning techniques in terms of prediction accuracy, and in terms of the stability of the estimated relevance of the attributes. The Random Forest is a useful method for learning predictive models and the relevance of SNPs without any underlying assumption.

X Demographics

X Demographics

The data shown below were collected from the profiles of 10 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 184 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Unknown 184 100%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 30 16%
Student > Bachelor 25 14%
Researcher 22 12%
Student > Master 22 12%
Student > Doctoral Student 10 5%
Other 24 13%
Unknown 51 28%
Readers by discipline Count As %
Computer Science 32 17%
Biochemistry, Genetics and Molecular Biology 24 13%
Medicine and Dentistry 13 7%
Engineering 12 7%
Agricultural and Biological Sciences 6 3%
Other 35 19%
Unknown 62 34%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 9. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 11 July 2018.
All research outputs
#4,209,172
of 25,382,440 outputs
Outputs from Artificial Intelligence in Medicine
#109
of 913 outputs
Outputs of similar age
#68,986
of 326,430 outputs
Outputs of similar age from Artificial Intelligence in Medicine
#2
of 11 outputs
Altmetric has tracked 25,382,440 research outputs across all sources so far. Compared to these this one has done well and is in the 83rd percentile: it's in the top 25% of all research outputs ever tracked by Altmetric.
So far Altmetric has tracked 913 research outputs from this source. They receive a mean Attention Score of 4.8. This one has done well, scoring higher than 88% of its peers.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 326,430 tracked outputs that were published within six weeks on either side of this one in any source. This one has done well, scoring higher than 78% of its contemporaries.
We're also able to compare this research output to 11 others from the same source and published within six weeks on either side of this one. This one has done well, scoring higher than 81% of its contemporaries.