loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
16th International Conference on Data Engineering (ICDE'00)
Efficient Searches for Similar Subsequences of Different Lengths in Sequence Databases
San Diego, California
February 28-March 03
ISBN: 0-7695-0506-6
Sanghyun Park, University of California at Los Angeles
Wesley W. Chu, University of California at Los Angeles
Jeehee Yoon, Hallym University
We propose an indexing technique for fast retrieval of similar subsequences using time warping distances. A time warping distance is a more suitable similarity measure than the Euclidean distance in many applications, where sequences may be of different lengths or different sampling rates. Our indexing technique uses a disk-based suffix tree as an index structure and employs lower-bound distance functions to filter out dissimilar subsequences without false dismissals. To make the index structure compact and thus accelerate the query processing, we convert sequences of continuous values to sequences of discrete values via a categorization method and store only a subset of suffixes whose first values are different from their preceding values. The experimental results reveal that our proposed technique can be a few orders of magnitude faster than sequential scanning.
Index Terms:
Similarity Search, Sequence Database, Time Warping, Suffix Tree
Citation:
Sanghyun Park, Wesley W. Chu, Jeehee Yoon, Chihcheng Hsu, "Efficient Searches for Similar Subsequences of Different Lengths in Sequence Databases," icde, pp.23, 16th International Conference on Data Engineering (ICDE'00), 2000
Usage of this product signifies your acceptance of the Terms of Use.