Third International Conference on Information Technology and Applications (ICITA'05) Volume 1 A Decomposition Scheme Based on Error-Correcting Output Codes for Ensembles of Text Categorisers Sydney, Australia July 04-July 07 ISBN: 0-7695-2316-1
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICITA.2005.9
Error-Correcting Output Codes (ECOC) are commonly used to decompose a multi-category problem into many dichotomies. Therefore, the text categorisation task is performed by an ensemble of binary classifiers instead of a single monolithic classifier. The ensemble performance largely depends on the characteristics of the decomposition. We propose a decomposition approach where both the categories and the classifiers are well separated in order to maximise the decision boundaries and minimise correlated predictions. We apply this design to the El Mundo corpus (newspaper articles in Spanish) and the well-known ModApt?e split of the Reuters-21578 corpus. The results using ensembles are favourably compared to those using a monolithic classifier.
Citation:
Juan José García Adeva, Rafael A. Calvo, "A Decomposition Scheme Based on Error-Correcting Output Codes for Ensembles of Text Categorisers," icita, vol. 1, pp.375-378, Third International Conference on Information Technology and Applications (ICITA'05) Volume 1, 2005 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||