Stanford, CA, USA
Aug. 11, 2003 to Aug. 14, 2003
Matthias H?chsmann , University of Bielefeld
Thomas T?ller , University of Bielefeld
Robert Giegerich , University of Bielefeld
Stefan Kurtz , University of Bielefeld
We present a systematic treatment of alignment distance and local similarity algorithms on trees and forests. We build upon the tree alignment algorithm for ordered trees given by Jiang et. al (1995) and extend it to calculate local forest alignments, which is essential for finding local similar regions in RNA secondary structures. The time complexity of our algorithm is O(|F<sub>1</sub>| \cdot |F<sub>2</sub> \cdot deg(F<sub>1</sub>) \cdot deg(F<sub>2</sub>) \cdot (deg(F<sub>1</sub>) + deg(F<sub>2</sub>)) where |F<sub>i</sub>| is the number of nodes in forest F<sub>i</sub> and deg (F<sub>i</sub>) is the degree of F<sub>i</sub>. We provide carefully engineered dynamic programming implementations using dense, two-dimensional tables which considerably reduces the space requirement. We suggest a new representation of RNA secondary structures as forests that allow reasonable scoring of edit operations on RNA secondary structures. The comparison of RNA secondary structures is facilitated by a new visualization technique for RNA secondary structure alignments. Finally, we show how potential regulatory motifs can be discovered solely by their structural preservation, and independent of their sequence conservation and position.
Matthias H?chsmann, Thomas T?ller, Robert Giegerich, Stefan Kurtz, "Local Similarity in RNA Secondary Structures", CSB, 2003, Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003, Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003 2003, pp. 159, doi:10.1109/CSB.2003.1227315