2009 IEEE International Conference on Bioinformatics and Biomedicine (2009)
Washington, D.C., USA
Nov. 1, 2009 to Nov. 4, 2009
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/BIBM.2009.41
We consider the problem of identifying motifs, recurring or conserved patterns, in the sets of biological sequences. To solve this task, we present new deterministic and exact algorithms for finding patterns that are embedded as exact or inexact instances in all or most of the input strings. The proposed algorithms (1) improve search efficiency compared to existing exact algorithms by focusing search on a selected set of potential motif instances, and (2) scale well with the input length and the size of alphabet.Our algorithms are orders of magnitude faster than existingexact algorithms for common pattern identification. We evaluate our algorithms on benchmark motif finding problemsand real applications in biological sequence analysis and show that they exhibit significant running time improvements compared to the state-of-the-art approaches.
P. Kuksa and V. Pavlovic, "Fast Motif Selection for Biological Sequences," 2009 IEEE International Conference on Bioinformatics and Biomedicine(BIBM), Washington, D.C., USA, 2009, pp. 79-82.