loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
20th International Conference on Data Engineering (ICDE'04)
Unordered Tree Mining with Applications to Phylogeny
Boston, Massachusetts
March 30-April 02
ISBN: 0-7695-2065-0
Dennis Shasha, New York University
Jason T. L. Wang, New Jersey Institute of Technology
Sen Zhang, New Jersey Institute of Technology
Frequent structure mining (FSM) aims to discover and extract patterns frequently occuring in structural data, such as trees and graphs. FSM finds many applications in bioinformatics, XML processing, Web log analysis, and so on. In this paper we present a new FSM technique for finding patterns in rooted unordered labeled trees. The patterns of interest are cousin pairs in these trees. A cousin pair is a pair of nodes sharing the same parent, the same grand-parent, or the same great-grandparent, etc. Given a tree T, our algorithm finds all interesting cousin pairs of T in O(|T|2) time when |T| is the number of nodes in T. Experimental results on synthetic data and phylogenies show the scalability and effectiveness of the proposed technique. To demonstrate the usefulness of our approach, we discuss its applications to locating co-occurring patterns in multiple evolutionary trees, evaluating the consensus of equally parsimonious trees, and finding kernel trees of groups of phylogenies. We also describe extensions of our algorithms for undirected acyclic graphs (or free trees).
Citation:
Dennis Shasha, Jason T. L. Wang, Sen Zhang, "Unordered Tree Mining with Applications to Phylogeny," icde, pp.708, 20th International Conference on Data Engineering (ICDE'04), 2004
Usage of this product signifies your acceptance of the Terms of Use.