CSDL Home IEEE/ACM Transactions on Computational Biology and Bioinformatics 2009 vol.6 Issue No.04 - October-December
Issue No.04 - October-December (2009 vol.6)
Changjin Hong , University of Minnesota, Minneapolis
Ahmed H. Tewfik , University of Minnesota, Minneapolis
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TCBB.2009.30
Recomputation of the previously evaluated similarity results between biological sequences becomes inevitable when researchers realize errors in their sequenced data or when the researchers have to compare nearly similar sequences, e.g., in a family of proteins. We present an efficient scheme for updating local sequence alignments with an affine gap model. In principle, using the previous matching result between two amino acid sequences, we perform a forward-backward alignment to generate heuristic searching bands which are bounded by a set of suboptimal paths. Given a correctly updated sequence, we initially predict a new score of the alignment path for each contour to select the best candidates among them. Then, we run the Smith-Waterman algorithm in this confined space. Furthermore, our heuristic alignment for an updated sequence shows that it can be further accelerated by using reusable dynamic programming (rDP), our prior work. In this study, we successfully validate "relative node tolerance bound” (RNTB) in the pruned searching space. Furthermore, we improve the computational performance by quantifying the successful RNTB tolerance probability and switch to rDP on perturbation-resilient columns only. In our searching space derived by a threshold value of 90 percent of the optimal alignment score, we find that 98.3 percent of contours contain correctly updated paths. We also find that our method consumes only 25.36 percent of the runtime cost of sparse dynamic programming (sDP) method, and to only 2.55 percent of that of a normal dynamic programming with the Smith-Waterman algorithm.
Shortest path, minimum spanning tree, sensitivity analysis, dynamic programming, sequence alignment, string edit, suboptimal paths.
Changjin Hong, Ahmed H. Tewfik, "Heuristic Reusable Dynamic Programming: Efficient Updates of Local Sequence Alignment", IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol.6, no. 4, pp. 570-582, October-December 2009, doi:10.1109/TCBB.2009.30