Issue No. 03 - May/June (2011 vol. 8)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TCBB.2010.76
Daniel G. Brown , University of Waterloo, Waterloo
Alexander K. Hudek , University of Waterloo, Waterloo
We present a pairwise local aligner, FEAST, which uses two new techniques: a sensitive extension algorithm for identifying homologous subsequences, and a descriptive probabilistic alignment model. We also present a new procedure for training alignment parameters and apply it to the human and mouse genomes, producing a better parameter set for these sequences. Our extension algorithm identifies homologous subsequences by considering all evolutionary histories. It has higher maximum sensitivity than Viterbi extensions, and better balances specificity. We model alignments with several submodels, each with unique statistical properties, describing strongly similar and weakly similar regions of homologous DNA. Training parameters using two submodels produces superior alignments, even when we align with only the parameters from the weaker submodel. Our extension algorithm combined with our new parameter set achieves sensitivity 0.59 on synthetic tests. In contrast, LASTZ with default settings achieves sensitivity 0.35 with the same false positive rate. Using the weak submodel as parameters for LASTZ increases its sensitivity to 0.59 with high error. FEAST is available at http://monod.uwaterloo.ca/feast/.
HMM, sequence evolution, local alignment, biology and genetics.
Daniel G. Brown, Alexander K. Hudek, "FEAST: Sensitive Local Alignment with Multiple Rates of Evolution", IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 8, no. , pp. 698-709, May/June 2011, doi:10.1109/TCBB.2010.76