This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Haar Wavelets for Efficient Similarity Search of Time-Series: With and Without Time Warping
May/June 2003 (vol. 15 no. 3)
pp. 686-705

Abstract—We address the handling of time series search based on two important distance definitions: Euclidean distance and time warping distance. Conventional method reduces the dimensionality by means of Discrete Fourier Transform. We apply the Haar Wavelet Transform technique and propose the use of a proper normalization so that the method can guarantee no false dismissal for Euclidean distance. We found that this method has competitive performance from our experiments. Euclidean distance measurement cannot handle the time shifts of patterns. It fails to match the same rise and fall patterns of sequences with different scales. A distance measure that handles this problem is the time warping distance. However, the complexity of computing the time warping distance function is high. Also, as time warping distance is not a metric, most indexing techniques would not guarantee any false dismissal. We propose efficient strategies to mitigate the problems of time warping. We suggest a Haar wavelet-based approximation function for time warping distance, called Low Resolution Time Warping, which results in less computation by trading off a small amount of accuracy. We apply our approximation function to similarity search in time series databases, and show by experiment that it is highly effective in suppressing the number of false alarms in similarity search.

[1] J.I. Agbinya, “Discrete Wavelet Transform Techniques in Speech Processing,” Proc. IEEE TENCON—Digital Signal Processing Applications Conf., pp. 514-519, 1996.
[2] R. Agrawal, C. Faloutsos, and A. Swami, “Efficient Similarity Search in Sequence Databases,” Proc. Fourth Int'l Conf. Foundations of Data Organization and Algorithms, pp. 69-84, Oct. 1993.
[3] R. Agrawal and R. Srikant, “Mining Sequential Patterns,” Proc. 1995 Int'l Conf. Data Eng., pp. 3-14, Mar. 1995.
[4] A.N. Akansu and R.A. Haddad, Multiresolution Signal Decomposition.San Diego, Calif.: Academic Press, 1992.
[5] J.J. Benedetto and M.W. Frazier, Wavelets—Math. and Applications. CRC, 1994.
[6] D.J. Berndt and J. Clifford, Advances in Knowledge Discovery and Data Mining. AAAI/MIT Press, 1995.
[7] C.S. Burrus, R.A. Gopinath, and H. Guo, Introduction to Wavelets and Wavelet Transforms, A Primer. Prentice Hall, 1997.
[8] K.P. Chan and A. Fu, “Efficient Time Series Matching by Wavelets,” Proc. Int'l Conf. Data Eng., 1999.
[9] C. Chatfield, The Analysis of Time Series: An Introduction. Chapman and Hall, 1984.
[10] T. Edwards, “Discrete Wavelet Transforms: Theory and Implementation,” technical report, Stanford Univ., 1991.
[11] C. Faloutsos, M. Ranganathan, and I. Manolopoulos, “Fast Subsequence Matching in Time Series Databases,” Proc. ACM SIGMOD, pp. 419-429, May 1994.
[12] J. Foran, Fundamentals of Real Analysis. Marcel Dekker, 1991.
[13] R.C. Gonzalez and R.E. Woods, Digital Image Processing, Addison-Wesley, New York, 1993.
[14] K. Grochenig and W. R. Madych, “Multiresolution Analysis, Haar Bases, and Self-Similar Tilings of$\big. r^{n}\bigr.$,” IEEE Trans. Information Theory, vol. 38, no. 2, pp. 556-568, 1992.
[15] K.V.R. Kanth, D. Agrawal, and A. Singh, “Dimensionality Reduction for Similarity Searching in Dynamic Databases,” Proc. ACM SIGMOD Conf., 1998.
[16] E.J. Keogh and M.J. Pazzani, “Scaling Up Dynamic Time Warping for Datamining Applications,” Proc. ACM SIGKDD Conf. Knowledge Discovery and Data Mining, 2000.
[17] F. Korn, H. Jagadish, and C. Faloutsos, “Efficiently Supporting Ad Hoc Queries in Large Datasets of Time Sequences,” Proc. ACM SIGMOD Int'l Conf. Management of Data, pp. 289-300, May 1997.
[18] C.-S. Li, P.S. Yu, and V. Castelli, “Hierarchyscan: A Hierarchical Similarity Search Algorithm for Databases of Long Sequences,” Proc. Int'l Conf. Data Eng., 1996.
[19] C.S. Myers and L.R. Rabiner, “A Level Building Dynamic Time Warping Algorithm for Connected Word Recognition,” IEEE Trans. Acoustics, Speech, and Signal Processing, vol. 29, no. 2, 1981.
[20] A. Natsev, R. Rastogi, and K. Shim, “Walrus: A Similarity Retrieval Algorithm for Image Databases,” Proc. ACM SIGMOD Conf. Management of Data, pp. 395-406, 1999.
[21] A.V. Oppenheim and R.W. Schafer, Digital Signal Processing. Prentice Hall, 1975.
[22] P. Sanghyun, W. Chu, J. Yoon, and C. Hsu, “Efficient Similarity Searches for Time-Warped Subsequences in Sequence Databases,” Proc. Int'l Conf. Data Eng., 2000.
[23] L.R. Rabiner and B.H. Juang, Fundamentals of Speech Recognition, Prentice Hall, Upper Saddle River, N.J., 1993.
[24] D. Rafiei and A. Mendelzon, “Similarity-Based Queries for Time Series Data,” Proc. ACM SIGMOD Conf. Management of Data, pp. 13-25, 1997.
[25] N. Roussopoulos, S. Kelley, and F. Vincent, “Nearest Neighbor Queries,” Proc. ACM SIGMOD Int'l Conf. Management of Data, pp. 71-79, 1995.
[26] H. Sakoe, “Two-Level DP-Matching—A Dynamic Programming-Based Pattern Matching Algorithm for Connected Word Recognition,” IEEE Trans. Acoustics, Speech, and Signal Processing, vol. 27, no. 6, 1979.
[27] H. Sakoe and S. Chiba, "Dynamic Programming Optimization for Spoken Word Recognition," IEEE Trans. ASSP, vol. 26, pp. 623-625, 1980.
[28] D. Shasha, Time Series in Finance: The Array Database Approach. http://www.cs.nyu.edu/cs/faculty/shasha/ papersjagtalk.html Apr. 2000.
[29] E.J. Stollnitz, T.D. DeRose, and D.H. Salesin, Wavelets for Computer Graphics: Theory and Applications. Morgan Kaufmann, 1996.
[30] D. Wu, D. Agrawal, A. El Abbadi, A. Singh, and T.R. Smith, “Efficient Retrieval for Browsing Large Image Databases,” Proc. Conf. Information and Knowledge Management, 1996.
[31] B.-K. Yi and C. Faloutsos, “Fast Time Sequence Indexing for Arbitrary$\big. L_p\bigr.$Norms,” Proc. 26st VLDB Conf., pp. 385-394, 2000.
[32] B.-K. Yi, H.V. Jagadish, and C. Faloutsos, “Efficient Retrieval of Similar Time Sequences under Time Warping,” Proc. Int'l Conf. Data Eng., 1998.

Index Terms:
Similarity search, time warping, wavelets, dimension reduction, multidimensional index, time series database, data mining.
Citation:
Franky Kin-Pong Chan, Ada Wai-chee Fu, Clement Yu, "Haar Wavelets for Efficient Similarity Search of Time-Series: With and Without Time Warping," IEEE Transactions on Knowledge and Data Engineering, vol. 15, no. 3, pp. 686-705, May-June 2003, doi:10.1109/TKDE.2003.1198399
Usage of this product signifies your acceptance of the Terms of Use.