The Community for Technology Leaders
Parallel Architectures, Algorithms and Programming, International Symposium on (2011)
Tianjin, China
Dec. 9, 2011 to Dec. 11, 2011
ISBN: 978-0-7695-4575-2
pp: 241-246
ABSTRACT
Sequence alignment is of great importance in biology research. BLAST is a sequence alignment tool used extensively by researchers. However the continuously increasing amount of sequence data to be processed presents many challenges to it. This paper gives a simple and effective approach to parallelizing BLAST using the MapReduce technique. The MapReduce-BLAST shows very good performance and scales nearly linearly to the database size and query length. This results from both the power of MapReduce and the inherent parallel characteristics of the BLAST algorithm. Sequence alignment algorithms based on techniques similar with BLAST's seed-and-extend approach are very suitable for being parallelized with MapReduce.
INDEX TERMS
BLAST, parallelization, MapReduce, Hadoop, long sequence alignment
CITATION
Yi-hua Huang, Xiao-liang Yang, Yu-long Liu, Chun-feng Yuan, "Parallelization of BLAST with MapReduce for Long Sequence Alignment", Parallel Architectures, Algorithms and Programming, International Symposium on, vol. 00, no. , pp. 241-246, 2011, doi:10.1109/PAAP.2011.36
247 ms
(Ver 3.3 (11022016))