loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Third International Conference on Information Technology and Applications (ICITA'05) Volume 1
A Decomposition Scheme Based on Error-Correcting Output Codes for Ensembles of Text Categorisers
Sydney, Australia
July 04-July 07
ISBN: 0-7695-2316-1
Juan José García Adeva, University of Sydney
Rafael A. Calvo, University of Sydney
Error-Correcting Output Codes (ECOC) are commonly used to decompose a multi-category problem into many dichotomies. Therefore, the text categorisation task is performed by an ensemble of binary classifiers instead of a single monolithic classifier. The ensemble performance largely depends on the characteristics of the decomposition. We propose a decomposition approach where both the categories and the classifiers are well separated in order to maximise the decision boundaries and minimise correlated predictions. We apply this design to the El Mundo corpus (newspaper articles in Spanish) and the well-known ModApt?e split of the Reuters-21578 corpus. The results using ensembles are favourably compared to those using a monolithic classifier.
Citation:
Juan José García Adeva, Rafael A. Calvo, "A Decomposition Scheme Based on Error-Correcting Output Codes for Ensembles of Text Categorisers," icita, vol. 1, pp.375-378, Third International Conference on Information Technology and Applications (ICITA'05) Volume 1, 2005
Usage of this product signifies your acceptance of the Terms of Use.