The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.08 - August (1998 vol.20)
pp: 889-895
ABSTRACT
<p><b>Abstract</b>—Ordered, labeled trees are trees in which each node has a label and the left-to-right order of its children (if it has any) is fixed. Such trees have many applications in vision, pattern recognition, molecular biology and natural language processing. We consider a substructure of an ordered labeled tree <it>T</it> to be a connected subgraph of <it>T</it>. Given two ordered labeled trees <it>T</it><sub>1</sub> and <it>T</it><sub>2</sub> and an integer <it>d</it>, the largest approximately common substructure problem is to find a substructure <it>U</it><sub>1</sub> of <it>T</it><sub>1</sub> and a substructure <it>U</it><sub>2</sub> of <it>T</it><sub>2</sub> such that <it>U</it><sub>1</sub> is within edit distance <it>d</it> of <it>U</it><sub>2</sub> and where there does not exist any other substructure <it>V</it><sub>1</sub> of <it>T</it><sub>1</sub> and <it>V</it><sub>2</sub> of <it>T</it><sub>2</sub> such that <it>V</it><sub>1</sub> and <it>V</it><sub>2</sub> satisfy the distance constraint and the sum of the sizes of <it>V</it><sub>1</sub> and <it>V</it><sub>2</sub> is greater than the sum of the sizes of <it>U</it><sub>1</sub> and <it>U</it><sub>2</sub>. We present a dynamic programming algorithm to solve this problem, which runs as fast as the fastest known algorithm for computing the edit distance of two trees when the distance allowed in the common substructures is a constant independent of the input trees. To demonstrate the utility of our algorithm, we discuss its application to discovering motifs in multiple RNA secondary structures (which are ordered labeled trees).</p>
INDEX TERMS
Computational biology, dynamic programming, pattern matching, pattern recognition, trees.
CITATION
Jason T.L. Wang, Bruce A. Shapiro, Dennis Shasha, Kaizhong Zhang, Kathleen M. Currey, "An Algorithm for Finding the Largest Approximately Common Substructures of Two Trees", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.20, no. 8, pp. 889-895, August 1998, doi:10.1109/34.709622
18 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool