loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
15th International Conference on Scientific and Statistical Database Management
TreeRank: A Similarity Measure for Nearest Neighbor Searching in Phylogenetic Databases
Cambridge, Massachusetts, USA
July 09-July 11
ISBN: 0-7695-1964-4
Jason T. L. Wang, New Jersey Institute of Technology
Huiyuan Shan, New Jersey Institute of Technology
Dennis Shasha, New York University
William H. Piel, University at Buffalo
Phylogenetic trees are unordered labeled trees in which each leaf node has a label and the order among siblings is unimportant. In this paper we propose a new similarity measure, called TreeRank, for phylogenetic trees and present an algorithm for computing TreeRank scores. Given a query or pattern tree P and a data tree D, the TreeRank score from P to D is a measure of the topological relationships in P that are found to be the same or similar in D. The proposed algorithm calculates the TreeRank score in O(M2 + N) time where M is the number of nodes appearing in both P and D, and N is the number of nodes in D. We then develop a search engine that, given a query or pattern tree P and a database of trees D, finds and ranks the nearest neighbors of P in D where the "nearness" is measured by the proposed similarity function. This structure-based search engine is fully operational and is available on the World Wide Web.
Citation:
Jason T. L. Wang, Huiyuan Shan, Dennis Shasha, William H. Piel, "TreeRank: A Similarity Measure for Nearest Neighbor Searching in Phylogenetic Databases," ssdbm, pp.171, 15th International Conference on Scientific and Statistical Database Management, 2003
Usage of this product signifies your acceptance of the Terms of Use.