The Community for Technology Leaders
RSS Icon
Issue No.11 - November (2009 vol.21)
pp: 1617-1628
Ding-An Chiang , Tamkang University, Taiwan
Cheng-Tzu Wang , National Taipei University of Education, Taiwan
Shao-Ping Chen , Tamkang University, Taiwan
Chun-Chi Chen , Tamkang University, Taiwan
Sequential pattern mining has been used to predict various aspects of customer buying behavior for a long time. Discovered sequence reveals the chronological relation between items and provides valuable information to aid in developing marketing strategies. Nevertheless, we can hardly know whether the buying is cyclic and how long the interval between the two consecutive items in the sequential pattern is. To solve this problem, in this paper, data mining skills and the fundamentals of statistics are combined to develop a set of algorithms to unearth the cyclic properties of discovered sequential patterns. The algorithms, coupled with the sequential pattern mining process, constitute a thorough scheme to analyze and predict likely consumer behavior. The proposed algorithms are implemented and applied to test against real data collected from a consumer goods company. The experimental results illustrate how the model can be used to predict likely purchases within a certain time frame. Consequently, marketing professionals can execute campaigns to favorably impact customers' behaviors.
Association rules, data mining, frequency, sequential pattern, polynomial regression.
Ding-An Chiang, Cheng-Tzu Wang, Shao-Ping Chen, Chun-Chi Chen, "The Cyclic Model Analysis on Sequential Patterns", IEEE Transactions on Knowledge & Data Engineering, vol.21, no. 11, pp. 1617-1628, November 2009, doi:10.1109/TKDE.2009.36
[1] R. Agrawal, T. Imieliński, and A. Swami, “Mining Association Rules between Sets of Items in Large Databases,” Proc. ACM SIGMOD '93, pp. 207-216, 1993.
[2] R. Agrawal and R. Srikant, “Mining Sequential Patterns,” Proc. 1995 IEEE 11th Int'l Conf. Data Eng. (ICDE '95), pp. 3-14, 1995.
[3] R. Agrawal and R. Srikant, “Fast Algorithms for Mining Association Rules in Large Databases,” Proc. 20th Int'l Conf. Very Large Data Bases (VLDB '94), pp. 487-499, 1994.
[4] Y. Chen, M. Chiang, and M. Ko, “Discovering Time-Interval Sequential Patterns in Sequence Databases,” Expert Systems with Applications, vol. 25, pp. 343-354, 2003.
[5] D. Chiang, S. Lee, C. Chen, and M. Wang, “Mining Interval Sequential Patterns,” Int'l J. Intelligent Systems, vol. 20, pp. 359-373, 2005.
[6] M. Garofalakis, R. Rastogi, and K. Shim, “Mining Sequential Patterns with Regular Expression Constraints,” IEEE Trans. Knowledge and Data Eng., vol. 14, no. 3, pp. 530-552, May 2002.
[7] J. Han, G. Dong, and Y. Yin, “Efficient Mining of Partial Periodic Patterns in Time Series Database,” Proc. Int'l Conf. Data Eng. (ICDE '99), p.106-115, 1999.
[8] J. Han, J. Pei, B. Mortazavi-Asl, Q. Chen, U. Dayal, and M. Hsu, “FreeSpan: Frequent Pattern-Projected Sequential Pattern Mining,” Proc. ACM SIGKDD Int'l Conf. Knowledge Discovery and Data Mining (SIGKDD '00), pp. 355-359, 2000.
[9] J. Pei, J. Han, B. Mortazavi-Asl, H. Pinto, Q. Chen, U. Dayal, and M.-C. Hsu, “PrefixSpan: Mining Sequential Patterns Efficiently by Prefix-Projected Pattern Growth,” Proc. 17th Int'l Conf. Data Eng. (ICDE '01), pp. 215-224, 2001.
[10] J. Neter, M.H. Kutner, W. Wasserman, and C.J. Nachtsheim, Applied Linear Statistics Model, fourth ed. McGraw-Hill, 1996.
[11] M. Lin and S. Lee, “Fast Discovery of Sequential Patterns by Memory Indexing,” Proc. Fourth Int'l Conf. Data Warehousing Knowledge Discovery (DaWaK '02), pp. 150-160, 2002.
[12] F. Masseglia, F. Cathala, and P. Poncelet, “The PSP Approach for Mining Sequential Patterns,” Proc. Second European Symp. Principles Data Mining Knowledge Discovery (PKDD '98), pp. 176-184, 1998.
[13] M.A. Golberg and H.A. Cho, Introduction to Regression Analysis, vol. 1. WIT Press, 2003.
[14] B. Ozden, S. Ramaswamy, and A. Silberschatz, “Cyclic Association Rules,” Proc. 14th Int'l Conf. Data Eng. (ICDE '98), pp. 412-421, 1998.
[15] J. Pei, J. Han, and W. Wang, “Mining Sequential Patterns with Constraints in Large Databases,” Proc. 11th Int'l Conf. Information and Knowledge Management (CIKM '02), pp. 18-25, 2002.
[16] P. Rolland, “FlExPat: Flexible Extraction of Sequential Patterns,” Proc. IEEE Int'l Conf. Data Mining (ICDM '01), pp. 481-488, 2001.
[17] R. Srikant and R. Agrawal, “Mining Sequential Patterns: Generalizations and Performance Improvements,” Proc. Fifth Int'l Conf. Extending Database Technology, (EDBT '96), p. 3-17, 1996.
[18] J. Yang, W. Wang, P.S. Yu, and J. Han, “Mining Long Sequential Patterns in a Noisy Environment,” Proc. ACM SIGMOD '02, pp.406-417, 2002.
[19] M.J.E. Zaki, “SPADE: An Efficient Algorithm for Mining Frequent Sequences,” Machine Learning, vol. 42, nos. 1/2, pp. 31-60, 2001.
20 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool