10th Euromicro Workshop on Parallel, Distributed and Network-based Processing (EUROMICRO-PDP 2002) (2002)
Canary Islands, Spain
Jan. 9, 2002 to Jan. 11, 2002
G. Folino , ISI-CNR
C. Pizzuti , ISI-CNR
G. Spezzano , ISI-CNR
A parallel genetic programming approach to induce decision trees in large data sets is presented. A population of trees is evolved by employing the genetic operators and every individual is evaluated by using a fitness function based on the J-measure. The method is able to deal with large data sets since it uses a parallel implementation of genetic programming through the grid model. Experiments on data sets from the UCI machine learning repository show better results with respect to C5. Furthermore, performance results show a nearly linear speedup.
Decision Trees, Genetic programming, classification, parallel processing
G. Folino, C. Pizzuti and G. Spezzano, "Improving Induction Decision Trees with Parallel Genetic Programming," 10th Euromicro Workshop on Parallel, Distributed and Network-based Processing (EUROMICRO-PDP 2002)(PDP), Canary Islands, Spain, 2002, pp. 0181.