loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Seventh IEEE Symposium on Computers and Communications (ISCC'02)
Combining Homogeneous Classifiers for Centroid-based Text Classification
Ramada Hotel, Taormina-Giardini Naxos, Italy
July 01-July 04
ISBN: 0-7695-1671-8
Verayuth Lertnattee, Thammasart University
Thanaruk Theeramunkong, Thammasart University
Centroid-based text classification is one of the most popular supervised approaches to classify texts into a set of pre-defined classes. Based on the vector-space model, the performance of this classification particularly depends on the way to weight and select important terms in documents for constructing a prototype class vector for each class. In the past, it was shown that term weighting using statistical term distributions could improve classification accuracy. However, for different data sets, the best weighting systems are different. Towards this problem, we propose a method that uses homogenous centroid-based classification. The effectiveness of this approach is explored using four data sets. Two main factors are taken into account: model selection and score combination. By experiments, the results show that our system can improve classification accuracy up to 7.5-8. 5% comparing to k-NN classifier, 3.7-4.0% comparing with na?ve Bayes classifier and 1.6-2.7% over the best single-model classification method (p\lt;0.05).
Citation:
Verayuth Lertnattee, Thanaruk Theeramunkong, "Combining Homogeneous Classifiers for Centroid-based Text Classification," iscc, pp.1034, Seventh IEEE Symposium on Computers and Communications (ISCC'02), 2002
Usage of this product signifies your acceptance of the Terms of Use.