This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Evolutionary Constructive Induction
November 2005 (vol. 17 no. 11)
pp. 1518-1528
Feature construction in classification is a preprocessing step in which one or more new attributes are constructed from the original attribute set, the object being to construct features that are more predictive than the original feature set. Genetic programming allows the construction of nonlinear combinations of the original features. We present a comprehensive analysis of genetic programming (GP) used for feature construction, in which four different fitness functions are used by the GP and four different classification techniques are subsequently used to build the classifier. Comparisons are made of the error rates and the size and complexity of the resulting trees. We also compare the overall performance of GP in feature construction with that of GP used directly to evolve a decision tree classifier, with the former proving to be a more effective use of the evolutionary paradigm.

[1] F.E.B. Otero, M.M.S. Silva, A.A. Freitas, and J.C. Nievola, “Genetic Programming for Attribute Construction in Data Mining,” Genetic Programming: Proc. Sixth European Conf. (EuroGP-2003), vol. 2610, pp. 384-393, 2003.
[2] M.A. Muharram and G.D. Smith, “The Effect of Evolved Attributes on Classification Algorithms,” AI 2003, Advances in Artificial Intelligence, Proc. 16th Australian Conf. AI, T. Gedeon and L.C.C. Fung, eds., no. 2903, pp. 933-941, 2003.
[3] D. Biggs, B. de Ville, and E. Suen, “A Method of Choosing Multiway Partitions for Classification and Decision Trees,” J. Applied Statistics, vol. 18, pp. 49-62, 1991.
[4] L. Breiman, J.H. Friedman, R.A. Olshen, and C.J. Stone, Classification and Regression Trees. Belmont, Calif.: Wadsworth, Inc., 1984.
[5] I.H. Witten and E. Frank, Data Mining: Practical Machine Learning Tools and Techniques with Java. Morgan Kaufmann, 1999.
[6] D. Treigueiros and R.H. Berry, “The Application of Neural Network Based Methods to the Extraction of Knowledge from Accounting Reports,” Proc. 24th Ann. Hawaii Int'l Conf. System Sciences IV, pp. 137-146, 1991.
[7] S.K. Murthy, S. Kasif, and S. Salzberg, “A System for Induction of Oblique Decision Trees,” J. Artificial Intelligence Research, vol. 2, pp. 1-32, 1994.
[8] Z. Zheng, “Effects of Different Types of New Attribute on Constructive Induction,” Proc. Eighth Int'l Conf. Tools with Artifical Intelligence (ICTAI '96), pp. 254-257, 1996.
[9] P. Utgoff and T.M. Mitchell, “Acquisition of Appropriate Bias for Inductive Concept Learning,” Proc. Second Nat'l Conf. Artificial Intelligence (AAAI-82), pp. 414-417, 1983.
[10] I. Kuscu, “A Genetic Constructive Induction Model,” Proc. Congress on Evolutionary Computation, P.J. Angeline et al., eds., vol. 1,, pp. 212-217, IEEE Press, 1999.
[11] H. Bensusan and I. Kuscu, “Constructive Induction Using Genetic Programming,” Proc. Int'l Conf. Machine Learning, Evolutionary Computing and Machine Learning Workshop, T. Fogarty and G. Venturini, eds., 1996.
[12] J. Koza, Genetic Programming: On the Programming of Computers by Means of Natural Selection. MIT Press, 1992.
[13] W.A. Tackett, “Genetic Programming for Feature Discovery and Image Discrimination,” Proc. Fifth Int'l Conf. Genetic Algorithms, pp. 303-311, 1993.
[14] J.R. Quinlan, C4.5: Programs for Machine Learning. San Mateo, Calif.: Morgan Kaufmann, 1993.
[15] G.V. Kass, “An Exploratory Technique for Investigating Large Quantities of Categorical Data,” Applied Statistics, vol. 29, pp. 119-127, 1980.
[16] M.A. Muharram and G.D. Smith, “Evolutionary Feature Construction Using Information Gain and Gini Index,” Proc. Seventh European Conf. Genetic Programming (EuroGP'04), M. Keijzer, ed., pp. 379-388, 2004.
[17] M.A. Muharram, “Constructive Induction through Genetic Programming,” PhD dissertation, School of Computing Sciences, UEA Norwich, U.K., 2005.
[18] J.R. Koza, Genetic Programming: On the Programming of Computers by Natural Selection. Cambridge, Mass.: MIT Press, 1992.

Index Terms:
Index Terms- Feature construction, genetic programming, classification.
Citation:
Mohammed Muharram, George D. Smith, "Evolutionary Constructive Induction," IEEE Transactions on Knowledge and Data Engineering, vol. 17, no. 11, pp. 1518-1528, Nov. 2005, doi:10.1109/TKDE.2005.182
Usage of this product signifies your acceptance of the Terms of Use.