loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
22nd International Conference on Advanced Information Networking and Applications - Workshops (aina workshops 2008)
A New Centroid-Based Classifier for Text Categorization
March 25-March 28
ISBN: 978-0-7695-3096-3
In recent years, centroid-based document classifiers receive wide interests from text mining community because of their simplicity and linear-time complexity. However, the traditional centroid-based classifiers ususally perform less effectively for Chinese text categorization. In this paper, we tackle the problem by developping a new way to calculate the class-specific weights for each term in the training phase; in the testing phase, the new documents are assigned to the centroid to which the document is most similar based on the weighted distance measurement. The experimental results demonstrate that the accuracy of our algorithm outperforms the traditional centroid-based classifiers, as well as outstanding efficiency compared with the Support Vector Machine (SVM) based classifers for Chinese text categorization.
Index Terms:
text categorization, centroid-based classifer, term weighting, class-specific weighting
Citation:
Lifei Chen, Yanfang Ye, Qingshan Jiang, "A New Centroid-Based Classifier for Text Categorization," ainaw, pp.1217-1222, 22nd International Conference on Advanced Information Networking and Applications - Workshops (aina workshops 2008), 2008
Usage of this product signifies your acceptance of the Terms of Use.