The Community for Technology Leaders
RSS Icon
Subscribe
Heidelberg, Germany
Apr. 2, 2001 to Apr. 6, 2001
ISBN: 0-7695-1001-9
pp: 0607
Sang-Wook Kim , Kangwon National University
Sanghyun Park , University of California, Los Angeles
Wesley W. Chu , University of California, Los Angeles
ABSTRACT
Abstract: This paper proposes a new novel method for similarity search that supports time warping in large sequence databases. Time warping enables finding sequences with similar patterns even when they are of different lengths. Previous methods for processing similarity search that supports time warping fail to employ multi-dimensional indexes without false dismissal since the time warping distance does not satisfy the triangular inequality. Our primary goal is to innovate on search performance without permitting any false dismissal. To attain this goal, we devise a new distance function D_{tw-lb} that consistently underestimates the time warping distance and also satisfies the triangular inequality. D_{tw-lb} uses a 4-tuple feature vector that is extracted from each sequence and is invariant to time warping. For efficient processing of similarity search, we employ a multi-dimensional index that uses the 4-tuple feature vector as indexing attributes and D_{tw-lb} as distance function. The extensive experimental results reveal that our method achieves significant speedup up to 43 times with real-world S&P 500 stock data and up to 720 times with very large synthetic data.
CITATION
Sang-Wook Kim, Sanghyun Park, Wesley W. Chu, "An Index-Based Approach for Similarity Search Supporting Time Warping in Large Sequence Databases", ICDE, 2001, 2013 IEEE 29th International Conference on Data Engineering (ICDE), 2013 IEEE 29th International Conference on Data Engineering (ICDE) 2001, pp. 0607, doi:10.1109/ICDE.2001.914875
45 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool