Parallel Architectures, Algorithms and Programming, International Symposium on (2011)
Dec. 9, 2011 to Dec. 11, 2011
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/PAAP.2011.36
Sequence alignment is of great importance in biology research. BLAST is a sequence alignment tool used extensively by researchers. However the continuously increasing amount of sequence data to be processed presents many challenges to it. This paper gives a simple and effective approach to parallelizing BLAST using the MapReduce technique. The MapReduce-BLAST shows very good performance and scales nearly linearly to the database size and query length. This results from both the power of MapReduce and the inherent parallel characteristics of the BLAST algorithm. Sequence alignment algorithms based on techniques similar with BLAST's seed-and-extend approach are very suitable for being parallelized with MapReduce.
BLAST, parallelization, MapReduce, Hadoop, long sequence alignment
Yi-hua Huang, Xiao-liang Yang, Yu-long Liu, Chun-feng Yuan, "Parallelization of BLAST with MapReduce for Long Sequence Alignment", Parallel Architectures, Algorithms and Programming, International Symposium on, vol. 00, no. , pp. 241-246, 2011, doi:10.1109/PAAP.2011.36