The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.01 - Jan. (2014 vol.26)
pp: 144-156
Wenjing Zhang , Marquette University, Milwaukee
Xin Feng , Marquette University, Milwaukee
ABSTRACT
The new method proposed in this paper applies a multivariate reconstructed phase space (MRPS) for identifying multivariate temporal patterns that are characteristic and predictive of anomalies or events in a dynamic data system. The new method extends the original univariate reconstructed phase space framework, which is based on fuzzy unsupervised clustering method, by incorporating a new mechanism of data categorization based on the definition of events. In addition to modeling temporal dynamics in a multivariate phase space, a Bayesian approach is applied to model the first-order Markov behavior in the multidimensional data sequences. The method utilizes an exponential loss objective function to optimize a hybrid classifier which consists of a radial basis kernel function and a log-odds ratio component. We performed experimental evaluation on three data sets to demonstrate the feasibility and effectiveness of the proposed approach.
INDEX TERMS
Materials requirements planning, Data systems, Optimization, Linear programming, Delay effects, Vectors, Euclidean distance,dynamic data system, Temporal pattern, reconstructed phase space, Gaussian mixture models, optimization
CITATION
Wenjing Zhang, Xin Feng, "Event Characterization and Prediction Based on Temporal Patterns in Dynamic Data System", IEEE Transactions on Knowledge & Data Engineering, vol.26, no. 1, pp. 144-156, Jan. 2014, doi:10.1109/TKDE.2013.60
REFERENCES
[1] H. Kantz and T. Schreiber, Nonlinear Time Series Analysis. Cambridge Univ. Press, 1997.
[2] E. Keogh, S. Chu, D. Hart, and M. Pazzani, "An Online Algorithm for Segmenting Time Series," Proc. IEEE Int'l Conf. Data Mining, pp. 289-296, 2001.
[3] H. Wang, W. Fan, P.S. Yu, and J. Han, "Mining Concept-Drifting Data Streams Using Ensemble Classifiers," Proc. Ninth ACM SIGKDD Int'l Conf. Knowledge Discovery and Data Mining, pp. 226-235, 2003.
[4] H. Wang, J. Yin, J. Pei, P.S. Yu, and J.X. Yu, "Suppressing Model Overfitting in Mining Concept-Drifting Data Streams," Proc. 12th ACM SIGKDD Int'l Conf. Knowledge Discovery and Data Mining, pp. 736-741, 2006.
[5] X. Gu and H. Wang, "Online Anomaly Prediction for Robust Cluster Systems," Proc. IEEE Int'l Conf. Data Eng., pp. 1000-1011, 2009.
[6] J. Lin, E.J. Keogh, L. Wei, and S. Lonardi, "Experiencing SAX: a Novel Symbolic Representation of Time Series," Data Mining and Knowledge Discovery, vol. 15, no. 2, pp. 107-144, 2007.
[7] X. Feng and H. Huang, "A Fuzzy-Set-Based Reconstruction Phase Space Method for Identification of Temporal Patterns in Complex Time Series," IEEE Trans. Knowledge and Data Eng., vol. 17, no. 5, pp. 601-613, May 2005.
[8] C.-H. Lee, A. Liu, and W.-S. Chen, "Pattern Discovery of Fuzzy Time Series for Financial Prediction," IEEE Trans. Knowledge and Data Eng., vol. 18, no. 5, pp. 613-625, May 2006.
[9] R.J. Povinelli and X. Feng, "A New Temporal Pattern Identification Method for Characterization and Prediction of Complex Time Series Events," IEEE Trans. Knowledge and Data Eng., vol. 15, no. 2, pp. 339-352, Feb. 2003.
[10] R.J. Povinelli, M.T. Johnson, A.C. Lindgren, and J. Ye, "Time Series Classification Using Gaussian Mixture Models of Reconstructed Phase Spaces," IEEE Trans. Knowledge and Data Eng., vol. 16, no. 6, pp. 779-783, June 2004.
[11] A. Nanopoulos, R. Alcock, and Y. Manolopoulos, "Feature-Based Classification of Time-Series Data," Int'l J. Computer Research, pp. 49-61, 2001.
[12] J.J. Rodriguez and C.J. Alonso, "Interval and Dynamic Time Warping-Based Decision Trees," Proc. ACM Symp. Applied Computing, pp. 548-552, 2004.
[13] C. Faloutsos, M. Ranganathan, and Y. Manolopoulos, "Fast Subsequence Matching in Time-Series Databases," Proc. ACM SIGMOD Int'l Conf. Management of Data, pp. 419-429, 1994.
[14] K. Chan and A. Fu, "Efficient Time Series Matching by Wavelets," Proc. IEEE Int'l Conf. Data Eng., pp. 126-133, 1999.
[15] X. Feng and O. Senyana, "Mining Multiple Temporal Patterns of Complex Dynamic Data Systems," Proc. IEEE Symp. Computational Intelligence and Data Mining, pp. 411-417, 2009.
[16] A.G. Capodaglio, H. Jones, V. Novotny, and X. Feng, "Sludge Bulking Analysis and Forecasting: Application of System Identification and Artificial Neural Computing Technologies," Water Research, vol. 25, no. 10, pp. 1217-1224, 1991.
[17] D.B. Percival and A.T. Walden, Wavelet Methods for Time Series Analysis. Cambridge Univ. Press, 2000.
[18] F. Takens, "Detecting Strange Attractors in Turbulence," Dynamical Systems and Turbulence. Springer, 1981.
[19] M.B. Kennel, R. Brown, and H.D. Abarbanel, "Determining Embedding Dimension for Phase-Space Reconstruction Using a Geometrical Construction," Physical Rev. vol. 45, no. 6 pp. 3403-3411, 1992.
[20] T. Sauer, J.A. Yorke, and M. Casdagli, "Embedology," J. Statistical Physics, vol. 65, pp. 579-616, 1991.
[21] J. Iwanski and E. Bradley, "Recurrence Plot Analysis: To Embed or Not to Embed," Chaos, vol. 8, pp. 861-871, 1998.
[22] T.K. Moon, "The Expectation-Maximization Algorithm," IEEE Signal Processing Magazine, vol. 13, no. 6, pp. 47-59, Nov. 1996.
[23] A. Dempster, N. Laird, and D. Rubin, "Maximum Likelihood from Incomplete Data via the EM Algorithm," J. Royal Statistical Soc., Series B, vol. 39, no. 1, pp. 1-38, 1977.
[24] J. Friedman, T. Hastie, and R. Tibshirani, "Additive Logistic Regression: A Statistical View of Boosting," The Annals of Statistics, vol. 28, no. 2, pp. 337-407, 2000.
[25] G. Box and G. Jenkins, Time Series Analysis: Forecasting and Control. Holden-Day, 1976.
[26] Y. Quilfen, A. Bentamy, P. Delecluse, K. Katsaros, and N. Grima, "Prediction of Sea Level Anomalies Using Ocean Circulation Model Forced by Scatterometer Wind and Validation Using TOPEX/Poseidon Data," IEEE Trans. Geoscience and Remote Sensing, vol. 38, no. 4, pp. 1871-1884, July 2000.
[27] D. Sciamarella and G. Mindlin, "Unveiling the Topological Structure of Chaotic Flows from Data," Physical Rev. E, vol. 64, article 036209, 2001.
[28] L. Pritchett, "Understanding Patterns of Economic Growth: Searching for Hills among Plateaus, Mountains, and Plains," World Bank Economic Rev., vol. 14, no. 2, pp. 221-250, 2000.
[29] A.M. Fraser and H.L. Swinney, "Independent Coordinates for Strange Attractors from Mutual Information," Physical Rev. vol. 33, no. 2, pp. 1134-1146, 1986.
[30] K. Sternickel, "Automatic Pattern Recognition in ECG Time Series," Computer Methods and Programs in Biomedicine, vol. 68, no. 2, pp. 109-115, 2002.
[31] J. Nocedal and S.J. Wright, Numerical Optimization. Springer, 2006.
[32] J.V. Hansen and R.D. Nelson, "Neural Networks and Traditional Time Series Methods: A Synergistic Combination in State Economic Forecasts," IEEE Trans. Neural Networks, vol. 8, no. 4, pp. 863-873, July 1997.
[33] C.M. Bishop, Pattern Recognition and Machine Learning. Springer, 2007.
[34] C.C. Aggarwal, J. Han, J. Wang, and P.S. Yu, "A Framework for Clustering Evolving Data Streams," Proc. 29th Int'l Conf. Very Large Data Bases, pp. 81-92, 2003.
[35] J. Yang and J. Leskovec, "Patterns of Temporal Variation in Online Media," Proc. Fourth ACM WSDM Int'l Conf. Web Search and Data Mining, pp. 177-186, 2011.
[36] K. Sim, A.K. Poernomo, and V. Gopalkrishnan, "Mining Actionable Subspace Clusters in Sequential Data," Proc. 10th SIAM Int'l Conf. Data Mining, pp. 442-453, 2010.
[37] J. Gama, Knowledge Discovery from Data Streams. Chapman & Hall/CRC, 2010.
[38] L. Zhao and M.J. Zaki, "TriCluster: An Effective Algorithm for Mining Coherent Clusters in 3D Microarray Data," Proc. ACM SIGMOD Int'l Conf. Management of Data, pp. 694-705, 2005.
[39] W. Zhang, X. Feng, and N. Bansal, "Detecting Temporal Patterns Using RPS and SVM in the Dynamic Data Systems," Proc. IEEE Int'l Conf. Information and Automation, pp. 209-214, 2011.
54 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool