Title |
Data Mining and Pattern Recognition Models for Identifying Inherited Diseases: Challenges and Implications
|
---|---|
Published in |
Frontiers in Genetics, August 2016
|
DOI | 10.3389/fgene.2016.00136 |
Pubmed ID | |
Authors |
Lahiru Iddamalgoda, Partha S. Das, Achala Aponso, Vijayaraghava S. Sundararajan, Prashanth Suravajhala, Jayaraman K. Valadi |
Abstract |
Data mining and pattern recognition methods reveal interesting findings in genetic studies, especially on how the genetic makeup is associated with inherited diseases. Although researchers have proposed various data mining models for biomedical approaches, there remains a challenge in accurately prioritizing the single nucleotide polymorphisms (SNP) associated with the disease. In this commentary, we review the state-of-art data mining and pattern recognition models for identifying inherited diseases and deliberate the need of binary classification- and scoring-based prioritization methods in determining causal variants. While we discuss the pros and cons associated with these methods known, we argue that the gene prioritization methods and the protein interaction (PPI) methods in conjunction with the K nearest neighbors' could be used in accurately categorizing the genetic factors in disease causation. |
X Demographics
Geographical breakdown
Country | Count | As % |
---|---|---|
Sri Lanka | 2 | 20% |
Israel | 2 | 20% |
United States | 2 | 20% |
India | 2 | 20% |
Switzerland | 1 | 10% |
Unknown | 1 | 10% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Members of the public | 7 | 70% |
Scientists | 2 | 20% |
Unknown | 1 | 10% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
United States | 2 | 6% |
Unknown | 34 | 94% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Researcher | 12 | 33% |
Student > Ph. D. Student | 3 | 8% |
Student > Master | 3 | 8% |
Student > Bachelor | 2 | 6% |
Professor | 1 | 3% |
Other | 3 | 8% |
Unknown | 12 | 33% |
Readers by discipline | Count | As % |
---|---|---|
Agricultural and Biological Sciences | 6 | 17% |
Biochemistry, Genetics and Molecular Biology | 5 | 14% |
Computer Science | 5 | 14% |
Neuroscience | 2 | 6% |
Nursing and Health Professions | 1 | 3% |
Other | 2 | 6% |
Unknown | 15 | 42% |