This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Using Emerging Patterns to Construct Weighted Decision Trees
July 2006 (vol. 18 no. 7)
pp. 865-876
Decision trees (DTs) represent one of the most important and popular solutions to the problem of classification. They have been shown to have excellent performance in the field of data mining and machine learning. However, the problem of DTs is that they are built using data instances assigned to crisp classes. In this paper, we generalize decision trees so that they can take into account weighted classes assigned to the training data instances. Moreover, we propose a novel method for discovering weights for the training instances. Our method is based on emerging patterns (EPs). EPs are those itemsets whose supports (probabilities) in one class are significantly higher than their supports (probabilities) in the other classes. Our experimental evaluation shows that the new proposed method has good performance and excellent noise tolerance.

[1] H. Alhammady and K. Ramamohanarao, “The Application of Emerging Patterns for Improving the Quality of Rare-Class Classification,” Proc. 2004 Pacific-Asia Conf. Knowledge Discovery and Data Mining (PAKDD '04), 2004.
[2] H. Alhammady and K. Ramamohanarao, “Using Emerging Patterns and Decision Trees in Rare-Class Classification,” Proc. 2004 IEEE Int'l Conf. Data Mining (ICDM '04), 2004.
[3] H. Alhammady and K. Ramamohanarao, “Expanding the Training Data Space Using Emerging Patterns and Genetic Methods,” Proc. 2005 SIAM Int'l Conf. Data Mining (SDM '05), 2005.
[4] H. Fan and K. Ramamohanarao, “An Efficient Single-Scan Algorithm for Mining Essential Jumping Emerging Patterns for Classification,” Proc. 2002 Pacific-Asia Conf. Knowledge Discovery and Data Mining (PAKDD '02), 2002.
[5] H. Fan and K. Ramamohanarao, “Efficiently Mining Interesting Emerging Patterns,” Proc. Fourth Int'l Conf. Web-Age Information Management (WAIM '03), 2003.
[6] H. Fan and K. Ramamohanarao, “A Bayesian Approach to Use Emerging Patterns for Classification,” Proc. 14th Australasian Database Conf. (ADC '03), 2003.
[7] G. Dong and J. Li, “Efficient Mining of Emerging Patterns: Discovering Trends and Differences,” Proc. 1999 Int'l Conf. Knowledge Discovery and Data Mining (KDD '99), 1999.
[8] C. Blake, E. Keogh, and C.J. Merz, “UCI Repository of Machine Learning Databases,” technical report, Dept. of Information and Computer Science, Univ. of California at Irvine, www.ics.uci.edu/~mlearnMLRepository.html , 1999.
[9] I.H. Witten and E. Frank, Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann, 1999.
[10] D. Guozhu, Z. Xiuzhen, W. Limsoon, and L. Jinyan, “CAEP: Classification by Aggregating Emerging Patterns,” Proc. Second Int'l Conf. Discovery Science (DS '99), 1999.
[11] C. Olaru and L. Wehenkel, “A Complete Fuzzy Decision Tree Technique,” J. Fuzzy Sets and Systems, vol. 138, 2003.
[12] W. Li, J. Han, and J. Pei, “CMAR: Accurate and Efficient Classification Based on Multiple Class-Association Rules,” Proc. 2001 IEEE Int'l Conf. Data Mining (ICDM '01), 2001.
[13] T.G. Dietterich, “Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms,” J. Neural Computation, vol. 10, 1998.
[14] T. Fawcett, ROC Graphs: Notes and Practical Considerations for Researchers. Kluwer Academic, 2004.
[15] C. D'Amato, F. Esposito, D. Malerba, and M. Monopoli, “Extending the K-Nearest Neighbour Classification Algorithm to Symbolic Objects,” www.di.uniba.it~malerba/, 2006.
[16] R. Kohavi, D. Sommerfield, and J. Dougherty, “Data Mining Using MLC++: A Machine Learning Library in C++,” Int'l J. Artificial Intelligence Tools, vol. 6, no. 4, pp. 537-566, 1997.
[17] U.M. Fayyad and K.B. Irani, “Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning,” Proc. 13th Int'l Joint Conf. Artificial Intelligence, 1993.
[18] J.R. Quinlan, “Induction of Decision Trees,” Machine Learning, vol. 1, no. 1, pp. 81-106, 1986.
[19] J.R. Quinlan, C4.5: Programs for Machine Learning. Morgan Kaufmann, 1993.

Index Terms:
Classification, data mining.
Citation:
Hamad Alhammady, Kotagiri Ramamohanarao, "Using Emerging Patterns to Construct Weighted Decision Trees," IEEE Transactions on Knowledge and Data Engineering, vol. 18, no. 7, pp. 865-876, July 2006, doi:10.1109/TKDE.2006.116
Usage of this product signifies your acceptance of the Terms of Use.