Title |
‘Big data’, Hadoop and cloud computing in genomics
|
---|---|
Published in |
Journal of Biomedical Informatics, July 2013
|
DOI | 10.1016/j.jbi.2013.07.001 |
Pubmed ID | |
Authors |
Aisling O’Driscoll, Jurate Daugelaite, Roy D. Sleator |
Abstract |
Since the completion of the Human Genome project at the turn of the Century, there has been an unprecedented proliferation of genomic sequence data. A consequence of this is that the medical discoveries of the future will largely depend on our ability to process and analyse large genomic data sets, which continue to expand as the cost of sequencing decreases. Herein, we provide an overview of cloud computing and big data technologies, and discuss how such expertise can be used to deal with biology's big data sets. In particular, big data technologies such as the Apache Hadoop project, which provides distributed and parallelised data processing and analysis of petabyte (PB) scale data sets will be discussed, together with an overview of the current usage of Hadoop within the bioinformatics community. |
X Demographics
Geographical breakdown
Country | Count | As % |
---|---|---|
United States | 9 | 31% |
United Kingdom | 2 | 7% |
India | 2 | 7% |
Spain | 1 | 3% |
Netherlands | 1 | 3% |
Greece | 1 | 3% |
France | 1 | 3% |
Unknown | 12 | 41% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Members of the public | 19 | 66% |
Scientists | 9 | 31% |
Practitioners (doctors, other healthcare professionals) | 1 | 3% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
United States | 11 | 1% |
Brazil | 10 | 1% |
United Kingdom | 6 | <1% |
France | 4 | <1% |
Canada | 4 | <1% |
India | 4 | <1% |
Germany | 2 | <1% |
South Africa | 2 | <1% |
Japan | 2 | <1% |
Other | 18 | 2% |
Unknown | 861 | 93% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Student > Master | 195 | 21% |
Student > Ph. D. Student | 175 | 19% |
Researcher | 118 | 13% |
Student > Bachelor | 107 | 12% |
Student > Doctoral Student | 50 | 5% |
Other | 161 | 17% |
Unknown | 118 | 13% |
Readers by discipline | Count | As % |
---|---|---|
Computer Science | 412 | 45% |
Agricultural and Biological Sciences | 100 | 11% |
Engineering | 75 | 8% |
Biochemistry, Genetics and Molecular Biology | 52 | 6% |
Business, Management and Accounting | 42 | 5% |
Other | 102 | 11% |
Unknown | 141 | 15% |