This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
SMCA: A General Model for Mining Asynchronous Periodic Patterns in Temporal Databases
June 2005 (vol. 17 no. 6)
pp. 774-785
Chia-Hui Chang, IEEE Computer Society
Mining periodic patterns in time series databases is an important data mining problem with many applications. Previous studies have considered synchronous periodic patterns where misaligned occurrences are not allowed. However, asynchronous periodic pattern mining has received less attention and only been discussed for a sequence of symbols where each time point contains one event. In this paper, we propose a more general model of asynchronous periodic patterns from a sequence of symbol sets where a time slot can contain multiple events. Three parameters min\_rep, max\_dis, and global\_rep are employed to specify the minimum number of repetitions required for a valid segment of nondisrupted pattern occurrences, the maximum allowed disturbance between two successive valid segments, and the total repetitions required for a valid sequence. A 4-phase algorithm is devised to discover periodic patterns from a time series database presented in vertical format. The experiments demonstrate good performance and scalability with large frequent patterns.

[1] R. Agrawal and R. Srikant, “Fast Algorithms for Mining Association Rules,” Proc. 20th Int'l Conf. Very Large Data Bases (VLDB '94), pp. 487-499, 1994.
[2] R. Agrawal and R. Srikant, “Mining Sequential Patterns,” Proc. 11th Int'l Conf. Data Eng. (ICDE '95), pp. 3-14, 1995.
[3] C. Berberidis, I. Vlahavas, W. Aref, M. Atallah, and A. Elmagarmid, “The Discovery of Weak Periodicities in Large Time Series,” Proc. European Conf. Principles and Practice of Knowledge Discovery in Databases (PKDD '02), 2002.
[4] J. Han, G. Dong, and Y. Yin, “Efficient Mining Paritial Periodic Patterns in Time Series Database,” Proc. 15th Int'l Conf. Data Eng. (ICDE '99), pp. 106-115, 1999.
[5] J. Han, W. Gong, and Y. Yin, “Mining Segment-Wise Periodic Patterns in Time-Related Databases,” Proc. Fourth ACM SIGKDD Int'l Conf. Knowledge Discovery and Data Mining (KDD '98), pp. 214-218, 1998.
[6] J. Han, J. Pei, and Y. Yin, “Mining Frequent Patterns Without Candidate Generation,” Proc. ACM SIGMOD Int'l Conf. Management of Data (SIGMOD '00), pp. 1-12, 2000.
[7] K.Y. Huang and C.H. Chang, “Prowl: An Efficient Frequent Continuity Mining Algorithm on Event Sequences,” Proc. Sixth Int'l Conf. Data Warehousing and Knowledge Discovery (DaWaK '04), pp. 351-360, 2004.
[8] M.V. Katti, R. Sami-Subbu, P.K. Ranjekar, and V.S. Gupta, “Amino Acid Repeat Patterns in Protein Sequences: Their Diversity and Structural-Function Implications,” Protein Science, no. 9, pp. 1203-1209, 2000.
[9] S. Ma and J. Hellerstein, “Mining Partially Periodic Event Patterns with Unknown Periods,” Proc. Int'l Conf. Data Eng. (ICDE '01), pp. 205-214, 2001.
[10] H. Mannila, H. Toivonen, and A.I. Verkamo, “Discovering Frequent Episodes in Sequences,” Proc. First Int'l Conf. Knowledge Discovery and Data Mining, pp. 210-215, 1995.
[11] H. Mannila, H. Toivonen, and A.I. Verkamo, “Discovering Generalized Episodes Using Minimal Occurrences,” Proc. Second Int'l Conf. Knowledge Discovery and Data Mining, pp. 146-151, 1996.
[12] H. Mannila, H. Toivonen, and A.I. Verkamo, “Discovering Frequent Episodes in Event Sequences,” Data Mining and Knowledge Discovery, vol. 1, no. 3, pp. 259-289, 1997.
[13] B. Ozden, S. Ramaswamy, and A. Silberschatz, “Cyclic Association Rules,” Proc. 14th Int'l Conf. Data Eng. (ICDE '98), pp. 412-421, 1998.
[14] R. Srikant and R. Agrawal, “Mining Sequential Patterns: Generalizations and Performance Improvements,” Proc. Fifth Int'l Conf. Extending Database Technology (EDBT '96), pp. 3-17, 1996.
[15] A.K.H. Tung, H. Lu, J. Han, and L. Feng, “Efficient Mining of Intertransaction Association Rules,” IEEE Trans. Knowledge and Data Eng., vol. 15, no. 1, pp. 43-56, 2003.
[16] W. Wang, J. Yang, and P.S. Yu, “Mining Patterns in Long Sequential Data with Noise,” Proc. ACM SIGKDD Int'l Conf. Knowledge Discovery and Data Mining (KDD '00), pp. 28-33, 2000.
[17] J. Yang, W. Wang, and P.S. Yu, “Mining Asynchronous Periodic Patterns in Time Series Data,” IEEE Trans. Knowledge and Data Eng., vol. 15, no. 3, pp. 613-628, 2003.
[18] M.J. Zaki, “Spade: An Efficient Algorithm for Mining Frequent Sequences,” Machine Learning, vol. 42, nos. 1/2, pp. 31-60, 2001.

Index Terms:
Periodic pattern, asynchronous sequence, partial periodicity, temporal database.
Citation:
Kuo-Yu Huang, Chia-Hui Chang, "SMCA: A General Model for Mining Asynchronous Periodic Patterns in Temporal Databases," IEEE Transactions on Knowledge and Data Engineering, vol. 17, no. 6, pp. 774-785, June 2005, doi:10.1109/TKDE.2005.98
Usage of this product signifies your acceptance of the Terms of Use.