Altmetric – HBLAST: Parallelised sequence similarity – A Hadoop MapReducable basic local alignment search tool

You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.

Title	HBLAST: Parallelised sequence similarity – A Hadoop MapReducable basic local alignment search tool
Published in	Journal of Biomedical Informatics, January 2015
DOI	10.1016/j.jbi.2015.01.008
Pubmed ID	25625550
Authors	Aisling O’Driscoll, Vladislav Belogrudov, John Carroll, Kai Kropp, Paul Walsh, Peter Ghazal, Roy D. Sleator
Abstract	The recent exponential growth of genomic databases has resulted in the common task of sequence alignment becoming one of the major bottlenecks in the field of computational biology. It is typical for these large datasets and complex computations to require cost prohibitive High Performance Computing (HPC) to function. As such, parallelised solutions have been proposed but many exhibit scalability limitations and are incapable of effectively processing "Big Data" - the name attributed to datasets that are extremely large, complex and require rapid processing. The Hadoop framework, comprised of distributed storage and a parallelised programming framework known as MapReduce, is specifically designed to work with such datasets but it is not trivial to efficiently redesign and implement bioinformatics algorithms according to this paradigm. The parallelisation strategy of "divide and conquer" for alignment algorithms can be applied to both data sets and input query sequences. However, scalability is still an issue due to memory constraints or large databases, with very large database segmentation leading to additional performance decline. Herein, we present Hadoop Blast (HBlast), a parallelised BLAST algorithm that proposes a flexible method to partition both databases and input query sequences using "virtual partitioning". HBlast presents improved scalability over existing solutions and well balanced computational work load while keeping database segmentation and recompilation to a minimum. Enhanced BLAST search performance on cheap memory constrained hardware has significant implications for in field clinical diagnostic testing; enabling faster and more accurate identification of pathogenic DNA in human blood or tissue samples.

View on publisher site Alert me about new mentions

X Demographics

The data shown below were collected from the profiles of 2 X users who shared this research output. Click here to find out more about how the information was compiled.

Geographical breakdown

Country	Count	As %
China	1	50%
Unknown	1	50%

Demographic breakdown

Type	Count	As %
Members of the public	2	100%

Mendeley readers

The data shown below were compiled from readership statistics for 122 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country	Count	As %
Colombia	1	<1%
Malaysia	1	<1%
Australia	1	<1%
Brazil	1	<1%
India	1	<1%
Korea, Republic of	1	<1%
Spain	1	<1%
United States	1	<1%
Unknown	114	93%

Demographic breakdown

Readers by professional status	Count	As %
Researcher	24	20%
Student > Ph. D. Student	20	16%
Student > Master	17	14%
Student > Bachelor	15	12%
Student > Doctoral Student	7	6%
Other	22	18%
Unknown	17	14%

Readers by discipline	Count	As %
Computer Science	42	34%
Agricultural and Biological Sciences	18	15%
Biochemistry, Genetics and Molecular Biology	12	10%
Engineering	10	8%
Business, Management and Accounting	3	2%
Other	20	16%
Unknown	17	14%

Attention Score in Context

This research output has an Altmetric Attention Score of 5. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 16 August 2022.

All research outputs

#6,754,462

of 25,374,647 outputs

Outputs from Journal of Biomedical Informatics

#521

of 2,247 outputs

Outputs of similar age

#83,973

of 359,728 outputs

Outputs of similar age from Journal of Biomedical Informatics

#11

of 35 outputs

Altmetric has tracked 25,374,647 research outputs across all sources so far. This one has received more attention than most of these and is in the 73rd percentile.

So far Altmetric has tracked 2,247 research outputs from this source. They typically receive a little more attention than average, with a mean Attention Score of 5.3. This one has done well, scoring higher than 76% of its peers.

Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 359,728 tracked outputs that were published within six weeks on either side of this one in any source. This one has done well, scoring higher than 76% of its contemporaries.

We're also able to compare this research output to 35 others from the same source and published within six weeks on either side of this one. This one has gotten more attention than average, scoring higher than 68% of its contemporaries.

HBLAST: Parallelised sequence similarity – A Hadoop MapReducable basic local alignment search tool

About this Attention Score

Mentioned by

Citations

Readers on

X Demographics

Geographical breakdown

Demographic breakdown

Mendeley readers

Geographical breakdown

Demographic breakdown

Attention Score in Context