Third IEEE International Conference on Data Mining (ICDM'03)
Comparing Pure Parallel Ensemble Creation Techniques Against Bagging
Melbourne, Florida
November 19-November 22
ISBN: 0-7695-1978-4
We experimentally evaluate randomization-based approaches to creating an ensemble of decision-tree classifiers. Unlike methods related to boosting, all of the eight approaches considered here create each classifier in an ensemble independently of the other classifiers. Experiments were performed on 28 publicly available datasets, using C4.5 release 8 as the base classifier. While each of the other seven approaches has some strengths, we find that none of them is consistently more accurate than standard bagging when tested for statistical significance.
Citation:
Lawrence O. Hall, Kevin W. Bowyer, Robert E. Banfield, Divya Bhadoria, W. Philip Kegelmeyer, Steven Eschrich, "Comparing Pure Parallel Ensemble Creation Techniques Against Bagging," icdm, pp.533, Third IEEE International Conference on Data Mining (ICDM'03), 2003