Title |
The Human Phenotype Ontology: Semantic Unification of Common and Rare Disease
|
---|---|
Published in |
American Journal of Human Genetics, June 2015
|
DOI | 10.1016/j.ajhg.2015.05.020 |
Pubmed ID | |
Authors |
Tudor Groza, Sebastian Köhler, Dawid Moldenhauer, Nicole Vasilevsky, Gareth Baynam, Tomasz Zemojtel, Lynn Marie Schriml, Warren Alden Kibbe, Paul N. Schofield, Tim Beck, Drashtti Vasant, Anthony J. Brookes, Andreas Zankl, Nicole L. Washington, Christopher J. Mungall, Suzanna E. Lewis, Melissa A. Haendel, Helen Parkinson, Peter N. Robinson |
Abstract |
The Human Phenotype Ontology (HPO) is widely used in the rare disease community for differential diagnostics, phenotype-driven analysis of next-generation sequence-variation data, and translational research, but a comparable resource has not been available for common disease. Here, we have developed a concept-recognition procedure that analyzes the frequencies of HPO disease annotations as identified in over five million PubMed abstracts by employing an iterative procedure to optimize precision and recall of the identified terms. We derived disease models for 3,145 common human diseases comprising a total of 132,006 HPO annotations. The HPO now comprises over 250,000 phenotypic annotations for over 10,000 rare and common diseases and can be used for examining the phenotypic overlap among common diseases that share risk alleles, as well as between Mendelian diseases and common diseases linked by genomic location. The annotations, as well as the HPO itself, are freely available. |
X Demographics
Geographical breakdown
Country | Count | As % |
---|---|---|
United States | 25 | 30% |
Australia | 5 | 6% |
France | 5 | 6% |
Spain | 4 | 5% |
United Kingdom | 3 | 4% |
Germany | 3 | 4% |
India | 1 | 1% |
Israel | 1 | 1% |
Croatia | 1 | 1% |
Other | 6 | 7% |
Unknown | 29 | 35% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Members of the public | 46 | 55% |
Scientists | 30 | 36% |
Practitioners (doctors, other healthcare professionals) | 6 | 7% |
Science communicators (journalists, bloggers, editors) | 1 | 1% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
United States | 9 | 3% |
Spain | 3 | <1% |
Germany | 2 | <1% |
United Kingdom | 2 | <1% |
Netherlands | 1 | <1% |
South Africa | 1 | <1% |
Iceland | 1 | <1% |
Unknown | 300 | 94% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Researcher | 86 | 27% |
Student > Ph. D. Student | 67 | 21% |
Student > Master | 27 | 8% |
Other | 22 | 7% |
Student > Postgraduate | 15 | 5% |
Other | 55 | 17% |
Unknown | 47 | 15% |
Readers by discipline | Count | As % |
---|---|---|
Agricultural and Biological Sciences | 76 | 24% |
Biochemistry, Genetics and Molecular Biology | 55 | 17% |
Medicine and Dentistry | 49 | 15% |
Computer Science | 41 | 13% |
Business, Management and Accounting | 6 | 2% |
Other | 38 | 12% |
Unknown | 54 | 17% |