The Community for Technology Leaders
Green Image
We apply an analytical framework for the analysis of linearly combined classifiers to ensembles generated by bagging. This provides an analytical model of bagging misclassification probability as a function of the ensemble size, which is a novel result in the literature. Experimental results on real data sets confirm the theoretical predictions. This allows us to derive a novel and theoretically grounded guideline for choosing bagging ensemble size. Furthermore, our results are consistent with explanations of bagging in terms of classifier instability and variance reduction, support the optimality of the simple average over the weighted average combining rule for ensembles generated by bagging, and apply to other randomization-based methods for constructing classifier ensembles. Although our results do not allow to compare bagging misclassification probability with the one of an individual classifier trained on the \textit{original} training set, we discuss how the considered theoretical framework could be exploited to this aim.
Multiple Classifier Systems, Bagging, Linear Combiners, Classifier Fusion, pattern classification.
Serrau Alessandra, Roli Fabio, Giorgio Fumera, "A Theoretical Analysis of Bagging as a Linear Combination of Classifiers", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 30, no. , pp. 1293-1299, July 2008, doi:10.1109/TPAMI.2008.30
109 ms
(Ver )