This Article 
 Bibliographic References 
 Add to: 
Boosting an Associative Classifier
July 2006 (vol. 18 no. 7)
pp. 988-992
Associative classification is a new classification approach integrating association mining and classification. It becomes a significant tool for knowledge discovery and data mining. However, high-order association mining is time consuming when the number of attributes becomes large. The recent development of the AdaBoost algorithm indicates that boosting simple rules could often achieve better classification results than the use of complex rules. In view of this, we apply the AdaBoost algorithm to an associative classification system for both learning time reduction and accuracy improvement. In addition to exploring many advantages of the boosted associative classification system, this paper also proposes a new weighting strategy for voting multiple classifiers.

[1] R. Agrawal and R. Srikant, “Fast Algorithms for Mining Association Rules,” Proc. Int'l Conf. Very Large Data Bases (VLDB '94), pp. 487-499, 1999.
[2] E. Baralis and P. Garza, “A Lazy Approach to Prunning Classification Rules,” Proc. 2002 IEEE Int'l Conf. Data Mining (ICDM '02), pp. 35-42, 2002.
[3] G. Dong and J. Li, “Efficient Mining of Emerging Patterns: Discovering Trends and Differences,” Proc. Fifth ACM SIGKDD Int'l Conf. Knowledge Discovery and Data Mining, S. Chaudhuri and D. Madigan, eds., pp. 43-52, 1999.
[4] G. Dong, X. Zhang, L. Wong, and J. Li, “CAEP: Classification by Aggregating Emerging Patterns,” Proc. Second Int'l Conf. Discovery Science (DS '99), pp. 43-55, Dec. 1999.
[5] C. Elkan, “Boosting and Naïve Bayesian Learning,” Technical Report CS97-557, Dept. Computer Science and Eng., Univ. of California, San Diego, Sept. 1997.
[6] Y. Freund and R.E. Schapire, “Experiments with a New Boosting Algorithm,” Proc. 13th Int'l Conf. Machine Learning, pp. 148-156, 1996.
[7] Y. Freund and R.E. Schapire, “A Decision-Theoretic Generalization of On-Line Learning and an Aplication to Boosting,” J. Computer and System Sciences, vol. 55, no. 1, pp. 119-139, Aug. 1997.
[8] J. Han, J. Pei, and Y. Yin, “Mining Frequent Patterns without Candidate Generation,” Proc. 2000 ACM SIGMOD Int'l Conf. Management of Data (SIGMOD '00), pp. 1-12, May 2000.
[9] R. Kohavi, D. Sommerfield, and J. Dougherty, Data Mining Using MLC++: A Machine Learning Library in C++. Tools with Artificial Intelligence. IEEE CS Press, 1996.
[10] J. Li, G. Dong, K. Ramamohanarao, and L. Wong, “DeEPs: A New Instance-Based Lazy Discovery and Classification System,” Machine Learning, vol. 54, no. 2, pp. 99-124, 2004.
[11] W. Li, J. Han, and J. Pei, “CMAR: Accurate and Efficient Classification Based on Multiple Class-Association Rules,” Proc. 2001 IEEE Int'l Conf. Data Mining (ICDM '01), pp. 369-376, Nov. 2001.
[12] B. Liu, W. Hsu, and Y. Ma, “Integrating Classification and Association Rule Mining,” Proc. Fourth ACM SIGKDD Int'l Conf. Knowledge Discovery and Data Mining, pp. 80-86, Aug. 1998.
[13] P.M. Murph and D.W. Aha, “UCI Repository Of Machine Learning Databases,” Dept. Information and Computer Science, Univ. of California, Irvine, 1991.
[14] J.R. Quinlan, “Bagging, Boosting, and C4.5,” Proc. 13th Nat'l Conf. Artificial Intelligence and the Eighth Innovative Applications of Artificial Intelligence Conf., pp. 715-730, Aug. 1996.
[15] R.E. Schapire, “The Boosting Approach to Machine Learning— An Overview,” Proc. MSRI Workshop Nonlinear Estimation and Classification, pp. 149-172, Mar. 2002.
[16] R.E. Schapire and Y. Singer, “Improved Boosting Algorithms Using Confidence-Rated Predictions,” Machine Learning, vol. 37, no. 3, pp. 297-336, 1999.
[17] H. Schwenk and Y. Bengio, “Boosting Neural Networks,” Neural Computation, vol. 12, no. 8, pp. 1869-1887, 2000.
[18] Y. Sun, Y. Wang, and A.K.C. Wong, “An Overview of Associative Classifiers,” technical report, Pattern Analysis and Machine Intelligence Lab, Univ. of Waterloo (http://pami/pub/sunymoverview.pdf), Waterloo, Ontario, Canada, Feb. 2006.
[19] Y. Wang, “High-Order Pattern Discovery and Analysis of Discrete-Valued Data Sets,” PhD thesis, Univ. of Waterloo, Waterloo, Ontario, Canada, 1997.
[20] Y. Wang and A.K.C. Wong, “From Association to Classification: Inference Using Weight of Evidence,” IEEE Trans. Knowledge and Data Eng., vol. 15, no. 3, pp. 764-767, 2003.
[21] A.K.C. Wong and Y. Wang, “High Order Pattern Discovery from Discrete-Valued Data,” IEEE Trans. Knowledge and Data Eng., vol. 9, no. 6, pp. 877-893, 1997.

Index Terms:
Data mining, classification, association mining, classifier design and evaluation, pattern discovery, boosting.
Yanmin Sun, Yang Wang, Andrew K.C. Wong, "Boosting an Associative Classifier," IEEE Transactions on Knowledge and Data Engineering, vol. 18, no. 7, pp. 988-992, July 2006, doi:10.1109/TKDE.2006.105
Usage of this product signifies your acceptance of the Terms of Use.