This Article 
 Bibliographic References 
 Add to: 
A VLSI Architecture for Approximate Tree Matching
March 1998 (vol. 47 no. 3)
pp. 346-352

Abstract—The distance between two labeled ordered trees, α and β, is the minimum cost sequence of editing operations (insertions, deletions, and substitutions) needed to transform α into β such that the predecessor-descendant relation between nodes and the ordering of nodes is not changed. Approximate tree matching has applications in genetic sequence comparison, scene analysis, error recovery and correction in programming languages, and cluster analysis. Edit distance computation is a computationally intensive task, and the design of special purpose hardware could result in a significant speed up. This paper proposes a VLSI architecture for computing the distance between ordered h-ary trees, as well as arbitrary ordered trees. This is the very first special purpose architecture that has been proposed for this important problem. The architecture is a parallel realization of a dynamic programming algorithm and makes use of simple basic cells and requires regular nearest-neighbor communication. The architecture has been simulated and verified using the Cadence design tools.

[1] D.T. Hoang, "Searching Genetic Databases on Splash 2," Proc. IEEE Workshop FPGAs for Custom Computing Machines,Napa, Calif., 1993.
[2] S.Y. Lu, "A Tree-to-Tree Distance and Its Application to Cluster Analysis," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 1, no. 2, pp. 219-224, 1979.
[3] R. Sastry and N. Ranganathan, "A VLSI Architecture for Computing the Tree-to-Tree Distance," Proc. First IEEE Int'l Symp. High Performance Computer Architecture, pp. 330-339,Raleigh, N.C., 1995.
[4] K-C Tai, "The Tree-to-Tree Correction Problem," J. ACM, vol. 26, no. 3, pp. 422-433, 1979.
[5] R.A. Wagner and M.J. Fischer, "The String-to-String Correction Problem," J. ACM, vol. 21, no. 1, pp. 168-78, 1974.
[6] K. Zhang and D. Shasha, "Simple Fast Algorithms for the Editing Distance Between Trees and Related Problems," Siam J. Computing, vol. 18, no. 6, pp. 1,245-1,262, 1989.

Index Terms:
Trees, pattern matching, systolic algorithm, editing distance, Very Large Scale Integration (VLSI), special purpose hardware.
Raghu Sastry, N. Ranganathan, "A VLSI Architecture for Approximate Tree Matching," IEEE Transactions on Computers, vol. 47, no. 3, pp. 346-352, March 1998, doi:10.1109/12.660171
Usage of this product signifies your acceptance of the Terms of Use.