CSDL Home IEEE Transactions on Pattern Analysis & Machine Intelligence 2013 vol.35 Issue No.06 - June

Subscribe

Issue No.06 - June (2013 vol.35)

pp: 1464-1479

Pramod K. Vemulapalli , The Pennsylvania State University, State College

Vishal Monga , The Pennsylvania State University, State College

Sean N. Brennan , The Pennsylvania State University, State College

ABSTRACT

The extraction of robust features for comparing and analyzing time series is a fundamentally important problem. Research efforts in this area encompass dimensionality reduction using popular signal analysis tools such as the discrete Fourier and wavelet transforms, various distance metrics, and the extraction of interest points from time series. Recently, extrema features for analysis of time-series data have assumed increasing significance because of their natural robustness under a variety of practical distortions, their economy of representation, and their computational benefits. Invariably, the process of encoding extrema features is preceded by filtering of the time series with an intuitively motivated filter (e.g., for smoothing), and subsequent thresholding to identify robust extrema. We define the properties of robustness, uniqueness, and cardinality as a means to identify the design choices available in each step of the feature generation process. Unlike existing methods, which utilize filters “inspired” from either domain knowledge or intuition, we explicitly optimize the filter based on training time series to optimize robustness of the extracted extrema features. We demonstrate further that the underlying filter optimization problem reduces to an eigenvalue problem and has a tractable solution. An encoding technique that enhances control over cardinality and uniqueness is also presented. Experimental results obtained for the problem of time series subsequence matching establish the merits of the proposed algorithm.

INDEX TERMS

Robustness, Feature extraction, Vectors, Time series analysis, Optimization, Encoding, Noise, extrema features, Time series, pattern recognition, feature extraction

CITATION

Pramod K. Vemulapalli, Vishal Monga, Sean N. Brennan, "Robust Extrema Features for Time-Series Data Analysis",

*IEEE Transactions on Pattern Analysis & Machine Intelligence*, vol.35, no. 6, pp. 1464-1479, June 2013, doi:10.1109/TPAMI.2012.216REFERENCES

- [1] C. Faloutsos, M. Ranganathan, and Y. Manolopoulos, "Fast Subsequence Matching in Time-Series Databases,"
ACM SIGMOD Record, vol. 23, pp. 419-429, 1994.- [2] P.-F. Marteau, "Time Warp Edit Distance with Stiffness Adjustment for Time Series Matching,"
IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 31, no. 2, pp. 306-318, Feb. 2009.- [3] K. Van Laerhoven, E. Berlin, and B. Schiele, "Enabling Efficient Time Series Analysis for Wearable Activity Data,"
Proc. Eighth Int'l Conf. Machine Learning Applications, 2009.- [4] E. Keogh, "Welcome to the UCR Time Series Classification/Clustering Page," http://www.cs.ucr.edu/~eamonntime_ series_ data /, 2003.
- [5] E. Keogh, S. Lonardi, and B.Y.-c. Chiu, "Finding Surprising Patterns in a Time Series Database in Linear Time and Space,"
Proc. Eighth ACM SIGKDD Int'l Conf. Knowledge Discovery and Data Mining, pp. 550-556, 2002.- [6] K. Chan and A. Fu, "Efficient Time Series Matching by Wavelets,"
Proc. 15th Int'l Conf. Data Eng., pp. 126-133, 1999.- [7] F. Korn, H.V. Jagadish, and C. Faloutsos, "Efficiently Supporting Ad Hoc Queries in Large Data Sets of Time Sequences,"
Proc. ACM SIGMOD, pp. 289-300, 1997.- [8] E.J. Keogh, K. Chakrabarti, M.J. Pazzani, and S. Mehrotra, "Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases,"
Knowledge Information Systems, vol. 3, pp. 263-286, 2001.- [9] K. Chakrabarti, E. Keogh, S. Mehrotra, and M. Pazzani, "Locally Adaptive Dimensionality Reduction for Indexing Large Time Series Databases,"
Proc. ACM SIGMOD, pp. 151-162, 2002.- [10] Q. Chen, L. Chen, X. Lian, and Y. Liu, "Indexable PLA for Efficient Similarity Search,"
Proc. 33rd Int'l Conf. Very Large Data Bases, pp. 435-446, 2007.- [11] Y. Cai, "Indexing Spatio-Temporal Trajectories with Chebyshev Polynomials,"
Proc. ACM SIGMOD, pp. 599-610, 2004.- [12] J. Lin et al., "Experiencing Sax: A Novel Symbolic Representation of Time Series,"
Data Mining and Knowledge Discovery, vol. 15, pp. 107-144, 2007.- [13] T.M. Rath and R. Manmatha, "Word Image Matching Using Dynamic Time Warping,"
Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 2, 2003.- [14] M. Vlachos, D. Gunopoulos, and G. Kollios, "Discovering Similar Multidimensional Trajectories,"
Proc. Int'l Conf. Data Eng., pp. 673-684, 2002.- [15] L. Chen and R. Ng, "On the Marriage of LP-Norms and Edit Distance,"
Proc. 30th Int'l Conf. Very Large Data Bases, vol. 30, pp. 792-803, 2004.- [16] C. shing Perng, H. Wang, S.R. Zhang, and D.S. Parker, "Landmarks: A New Model for Similarity-Based Pattern Querying in Time Series Databases,"
Proc. Int'l Conf. Data Eng., pp. 33-42, 2000.- [17] P. Vemulapalli, V. Monga, and S. Brennan, "Optimally Robust Extrema Filters for Time Series Data,"
Proc. Am. Control Conf., 2012.- [18] A. Wang, "An Industrial-Strength Audio Search Algorithm,"
Proc Fourth Conf. Music Information Retrieval, pp. 7-13, 2003.- [19] S. Kadetotad, P. Vemulapalli, S. Brennan, and C. Lagoa, "Terrain-Aided Localization Using Feature-Based Particle Filtering,"
Proc. ASME Dynamic Systems Control Conf., Nov. 2011.- [20] P. Vemulapalli, A. Dean, and S. Brennan, "Pitch Based Vehicle Localization Using Time Series Subsequence Matching with Multi-Scale Extrema Features,"
Proc. Am. Control Conf., June 2011.- [21] R. McAteer, P. Kestener, A. Arneodo, and A. Khalil, "Automated Detection of Coronal Loops Using a Wavelet Transform Modulus Maxima Method,"
Solar Physics, vol. 262, pp. 387-397, 2010.- [22] C. Kicey and C. Lennard, "Unique Reconstruction of Band-Limited Signals by a Mallat-Zhong Wavelet Transform Algorithm,"
J. Fourier Analysis and Applications, vol. 3, pp. 63-82, 1997.- [23] A. Wang, "The Shazam Music Recognition Service,"
Comm. ACM, vol. 49, pp. 44-48, http://doi.acm.org/10.11451145287.1145312 , Aug. 2006.- [24] D. Ellis, "Robust Landmark-Based Audio Fingerprinting," http://labrosa.ee.columbia.edu/dpwe/resources matlab/, May 2009.
- [25] S. Boyd and L. Vandenberghe,
Convex Optimization. Cambridge Univ. Press, http://www.stanford.edu/~boyd/cvxbookbv_ cvxbook.pdf , Mar. 2004.- [26] R.J. Stern and H. Wolkowicz, "Indefinite Trust Region Subproblems and Nonsymmetric Eigenvalue Perturbations,"
SIAM J. Optimization, vol. 5, pp. 286-313, 1995.- [27] K. Anstreicher, X. Chen, H. Wolkowicz, and Y.-X. Yuan, "Strong Duality for a Trust-Region Type Relaxation of the Quadratic Assignment Problem,"
Linear Algebra and Its Applications, vol. 301, nos. 1-3, pp. 121-136, 1999.- [28] K. Hauser, "B553 Lecture 7: Constrained Optimization, Lagrange Multipliers, and KKT Conditions," http://homes.soic.indiana. edu/classes/spring2012/ csci/b553-hauserkconstrained_ optimization.pdf , 2012.
- [29] J. Canny, "A Computational Approach to Edge Detection,"
IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 8, no. 6, pp. 679-698, Nov. 1986.- [30] E. Bourennane, P. Gouton, M. Paindavoine, and F. Truchetet, "Generalization of Canny-Deriche Filter for Detection of Noisy Exponential Edge,"
Signal Processing, vol. 82, no. 10, pp. 1317-1328, 2002.- [31] M. Petrou and J. Kittler, "Optimal Edge Detectors for Ramp Edges,"
IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 13, no. 5, pp. 483-491, May 1991.- [32] H.S. Gandhi and E. Fink, "Compression of Time Series by Extracting Major Extrema,"
J. Experimental and Theoretical Artificial Intelligence, pp. 89-106, 2010.- [33] D.G. Lowe, "Distinctive Image Features from Scale-Invariant Keypoints,"
Int'l J. Computer Vision, vol. 60, pp. 91-110, 2004.- [34] R. Andrzejak, K. Lehnertz, C. Rieke, F. Mormann, P. David, and C. lger, "Indications of Nonlinear Deterministic and Finite Dimensional Structures in Time Series of Brain Electrical Activity: Dependence on Recording Region and Brain State,"
Physics Rev. E, 2001.- [35] V. Monga and B. Evans, "Perceptual Image Hashing via Feature Points: Performance Evaluation and Tradeoffs,"
IEEE Trans. Image Processing, vol. 15, no. 11, pp. 3452-3465, Nov. 2006.- [36] E. Keogh, "Exact Indexing of Dynamic Time Warping,"
Proc. 28th Int'l Conf. Very Large Data Bases, pp. 406-417, 2002.- [37] M. Vlachos, M. Hadjieleftheriou, D. Gunopulos, and E. Keogh, "Indexing Multidimensional Time-Series,"
The VLDB J., vol. 15, pp. 1-20, Jan. 2006.- [38] P. Indyk, "Algorithms for Nearest Neighbor Search," http://dimacs.rutgers.edu/Workshops/MiningTutorial pindyk-slides. ppt, Aug. 2001.
- [39] H. Ding, G. Trajcevski, P. Scheuermann, X. Wang, and E. Keogh, "Querying and Mining of Time Series Data: Experimental Comparison of Representations and Distance Measures,"
Proc. VLDB Endowment, vol. 1, no. 2, pp. 1542-1552, Aug. 2008.- [40] D. Wu, A. Singh, D. Agrawal, A. El Abbadi, and T.R. Smith, "Efficient Retrieval for Browsing Large Image Databases,"
Proc. Fifth Int'l Conf. Information and Knowledge Management, 1996. |