This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2009 International Conference on Artificial Intelligence and Computational Intelligence
A SVM-Based Text Classification Method with SSK-Means Clustering Algorithm
Shanghai, China
November 07-November 08
ISBN: 978-0-7695-3816-7
SVM-based classification needs lots of labeled data to train classifier model, but labeling training dataset is a time-wasting and energy-wasting task. Furthermore, the feature space is sparse commonly because of text’s high dimension. All of the factors above can influence the performance of classification. We propose a SVM-based text classification with SSK-means clustering algorithm where little labeled training data are needed. In this approach, training data, including both labeled and unlabeled data, are first clustered with guidance of the labeled data. The unlabeled data samples are then labeled based on the clusters obtained. SVM classifiers can be trained with the expanded training dataset. When the training dataset has only a little labeled data, this method has better performance than SVM classifiers.
Index Terms:
SVM classification, SSK-means clustering algorithm, labeled data
Citation:
Hongcan Yan, Chen Lin, Bicheng Li, "A SVM-Based Text Classification Method with SSK-Means Clustering Algorithm," aici, vol. 2, pp.379-383, 2009 International Conference on Artificial Intelligence and Computational Intelligence, 2009
Usage of this product signifies your acceptance of the Terms of Use.