↓ Skip to main content

MICA: A fast short-read aligner that takes full advantage of Many Integrated Core Architecture (MIC)

Overview of attention for article published in BMC Bioinformatics, April 2015
Altmetric Badge

Citations

dimensions_citation
11 Dimensions

Readers on

mendeley
30 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
MICA: A fast short-read aligner that takes full advantage of Many Integrated Core Architecture (MIC)
Published in
BMC Bioinformatics, April 2015
DOI 10.1186/1471-2105-16-s7-s10
Pubmed ID
Authors

Ruibang Luo, Jeanno Cheung, Edward Wu, Heng Wang, Sze-Hang Chan, Wai-Chun Law, Guangzhu He, Chang Yu, Chi-Man Liu, Dazong Zhou, Yingrui Li, Ruiqiang Li, Jun Wang, Xiaoqian Zhu, Shaoliang Peng, Tak-Wah Lam

Abstract

Short-read aligners have recently gained a lot of speed by exploiting the massive parallelism of GPU. An uprising alterative to GPU is Intel MIC; supercomputers like Tianhe-2, currently top of TOP500, is built with 48,000 MIC boards to offer ~55 PFLOPS. The CPU-like architecture of MIC allows CPU-based software to be parallelized easily; however, the performance is often inferior to GPU counterparts as an MIC card contains only ~60 cores (while a GPU card typically has over a thousand cores). To better utilize MIC-enabled computers for NGS data analysis, we developed a new short-read aligner MICA that is optimized in view of MIC's limitation and the extra parallelism inside each MIC core. By utilizing the 512-bit vector units in the MIC and implementing a new seeding strategy, experiments on aligning 150 bp paired-end reads show that MICA using one MIC card is 4.9 times faster than BWA-MEM (using 6 cores of a top-end CPU), and slightly faster than SOAP3-dp (using a GPU). Furthermore, MICA's simplicity allows very efficient scale-up when multiple MIC cards are used in a node (3 cards give a 14.1-fold speedup over BWA-MEM). MICA can be readily used by MIC-enabled supercomputers for production purpose. We have tested MICA on Tianhe-2 with 90 WGS samples (17.47 Tera-bases), which can be aligned in an hour using 400 nodes. MICA has impressive performance even though MIC is only in its initial stage of development. MICA's source code is freely available at http://sourceforge.net/projects/mica-aligner under GPL v3. Supplementary information is available as "Additional File 1". Datasets are available at www.bio8.cs.hku.hk/dataset/mica.

Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 30 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Japan 1 3%
France 1 3%
Unknown 28 93%

Demographic breakdown

Readers by professional status Count As %
Researcher 9 30%
Student > Ph. D. Student 6 20%
Student > Bachelor 4 13%
Professor > Associate Professor 3 10%
Student > Master 3 10%
Other 4 13%
Unknown 1 3%
Readers by discipline Count As %
Computer Science 10 33%
Engineering 8 27%
Agricultural and Biological Sciences 5 17%
Biochemistry, Genetics and Molecular Biology 3 10%
Business, Management and Accounting 1 3%
Other 2 7%
Unknown 1 3%