CSDL Home IEEE/ACM Transactions on Computational Biology and Bioinformatics 2008 vol.5 Issue No.04 - October-December

Subscribe

Issue No.04 - October-December (2008 vol.5)

pp: 503-513

Antoni Lozano , Technical University of Catalonia, Barcelona

Ron Y. Pinter , Technion - Israel Institute of Technology, Haifa

Oleg Rokhlenko , Technion - Israel Institute of Technology, Haifa

Gabriel Valiente , Technical University of Catalonia, Barcelona

Michal Ziv-Ukelson , Ben Gurion University of the Negev, Beer-Sheva

DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TCBB.2008.59

ABSTRACT

The optimal transformation of one tree into another by means of elementary edit operations is an important algorithmic problem that has several interesting applications to computational biology. Here we introduce a constrained form of this problem in which a partial mapping of a set of nodes (the "seeds") in one tree to a corresponding set of nodes in the other tree is given, and present efficient algorithms for both ordered and unordered trees. Whereas ordered tree matching based on seeded nodes has applications in pattern matching of RNA structures, unordered tree matching based on seeded nodes has applications in co-speciation and phylogeny reconciliation. The latter involves the solution of the planar tanglegram layout problem, for which a polynomial-time algorithm is given here.

INDEX TERMS

Graph algorithms, Graph Theory, Discrete Mathematics, Mathematics of Computing, Trees, Biology and genetics, Life and Medical Sciences, Computer Applications

CITATION

Antoni Lozano, Ron Y. Pinter, Oleg Rokhlenko, Gabriel Valiente, Michal Ziv-Ukelson, "Seeded Tree Alignment",

*IEEE/ACM Transactions on Computational Biology and Bioinformatics*, vol.5, no. 4, pp. 503-513, October-December 2008, doi:10.1109/TCBB.2008.59REFERENCES

- [2] B. DasGupta, X. He, T. Jiang, M. Li, J. Tromp, and L. Zhang, “On Distances between Phylogenetic Trees,”
Proc. Eighth Ann. ACM-SIAM Symp. Discrete Algorithms (SODA '97), pp. 427-436, 1997.- [4] J. Felsenstein, “Phylip—Phylogeny Inference Package (Version3.2),”
Cladistics, vol. 5, no. 1, pp. 164-166, 1989.- [5] U. Fößmeier and M. Kaufmann, “Nice Drawings for Planar Bipartite Graphs,”
Proc. Third Italian Conf. Algorithms and Complexity (CIAC '97), vol. 1203, pp. 122-134, 1997.- [6] J.A. Gallian, “A Dynamic Survey of Graph Labeling,”
Electronic J. Combinatorics, no. DS7, http://www.combinatorics.orgSurveys/, 2007.- [7] K.J. Gardiner, T.L. Marsh, and N.R. Pace, “Ion Dependence of the Bacillus Subtilis RNase P Reaction,”
J. Biological Chemistry, vol. 260, no. 9, pp. 5415-5419, 1985.- [10] P. Hugenholtz, “Exploring Prokaryotic Diversity in the Genomic Era,”
Genome Biology, vol. 3, no. 2, pp. reviews0003.1-reviews0003.8, 2002.- [14] A. Lozano, R. Pinter, O. Rokhlenko, G. Valiente, and M. Ziv-Ukelson, “Seeded Tree Alignment and Planar Tanglegram Layout,”
Proc. Seventh Workshop Algorithms in Bioinformatics (WABI'07), vol. 4645, pp. 98-110, 2007.- [16] D.W. Matula, “An Algorithm for Subtree Identification,”
SIAM Rev., vol. 10, pp. 273-274, 1968.- [17] D. Matula, “Subtree Isomorphism in $O(n^{5/2})$ ,”
Annals Discrete Math., vol. 2, pp. 91-106, 1978.- [19] N.R. Pace and J.W. Brown, “Evolutionary Perspective on the Structure and Function of Ribonuclease P, A Ribozyme,”
J.Bacteriology, vol. 177, no. 8, pp. 1919-1928, 1995.- [20] R.D.M. Page, ed.,
Tangled Trees: Phylogeny, Cospeciation, and Coevolution. The Univ. of Chicago Press, 2002.- [21] R.D.M. Page and G. Valiente, “An Edit Script for Taxonomic Classifications,”
BMC Bioinformatics, vol. 6, p. 208, 2005.- [22] R.Y. Pinter, O. Rokhlenko, D. Tsur, and M. Ziv-Ukelson, “Approximate Labelled Subtree Homeomorphism,”
Proc. 15th Ann. Symp. Combinatorial Pattern Matching (CPM '04), vol. 3109, pp.59-73, 2004.- [26] B.A. Shapiro and K. Zhang, “Comparing Multiple RNA Secondary Structures Using Tree Comparisons,”
Computer Applications in the Biosciences, vol. 6, no. 4, pp. 309-318, 1990.- [27] D. Shasha, J.T.-L. Wang, and R. Giugno, “Algorithmics and Applications of Tree and Graph Searching,”
Proc. 21st ACM SIGACT-SIGMOD-SIGART Symp. Principles of Database Systems (PODS '02), pp. 39-52, 2002.- [28] G. Valiente,
Algorithms on Trees and Graphs. Springer, 2002.- [30] G. Valiente, “A Fast Algorithmic Technique for Comparing Large Phylogenetic Trees,”
Proc. 12th Int'l Symp. String Processing and Information Retrieval (SPIRE '05), vol. 3772, pp. 371-376, 2005.- [31] C.R. Woese and N.R. Pace, “Probing RNA Structure, Function, and History by Comparative Analysis,”
The RNA World, R.F.Gesteland and J.F. Atkins, eds., pp. 91-117, Cold Spring Harbor Laboratory Press, 1993.- [32] W.N.W. Zainon and P. Calder, “Visualizing Phylogenetic Trees,”
Proc. Seventh Australasian User Interface Conf. (AUIC'06), pp. 145-152, 2006.- [33] K. Zhang, L. Wang, and B. Ma, “Computing Similarity between RNA Structures,”
Proc. 10th Ann. Symp. Combinatorial Pattern Matching (CPM '99), vol. 1645, pp. 281-293, 1999. |