|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
2009 International Conference on Artificial Intelligence and Computational Intelligence
A SVM-Based Text Classification Method with SSK-Means Clustering Algorithm
Shanghai, China
November 07-November 08
ISBN: 978-0-7695-3816-7
| ASCII Text | x | ||
| Hongcan Yan, Chen Lin, Bicheng Li, "A SVM-Based Text Classification Method with SSK-Means Clustering Algorithm," Artificial Intelligence and Computational Intelligence, International Conference on, vol. 2, pp. 379-383, 2009 International Conference on Artificial Intelligence and Computational Intelligence, 2009. | |||
| BibTex | x | ||
| @article{ 10.1109/AICI.2009.446, author = {Hongcan Yan and Chen Lin and Bicheng Li}, title = {A SVM-Based Text Classification Method with SSK-Means Clustering Algorithm}, journal ={Artificial Intelligence and Computational Intelligence, International Conference on}, volume = {2}, year = {2009}, isbn = {978-0-7695-3816-7}, pages = {379-383}, doi = {http://doi.ieeecomputersociety.org/10.1109/AICI.2009.446}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - Artificial Intelligence and Computational Intelligence, International Conference on TI - A SVM-Based Text Classification Method with SSK-Means Clustering Algorithm SN - 978-0-7695-3816-7 SP379 EP383 A1 - Hongcan Yan, A1 - Chen Lin, A1 - Bicheng Li, PY - 2009 KW - SVM classification KW - SSK-means clustering algorithm KW - labeled data VL - 2 JA - Artificial Intelligence and Computational Intelligence, International Conference on ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/AICI.2009.446
SVM-based classification needs lots of labeled data to train classifier model, but labeling training dataset is a time-wasting and energy-wasting task. Furthermore, the feature space is sparse commonly because of text’s high dimension. All of the factors above can influence the performance of classification. We propose a SVM-based text classification with SSK-means clustering algorithm where little labeled training data are needed. In this approach, training data, including both labeled and unlabeled data, are first clustered with guidance of the labeled data. The unlabeled data samples are then labeled based on the clusters obtained. SVM classifiers can be trained with the expanded training dataset. When the training dataset has only a little labeled data, this method has better performance than SVM classifiers.
Index Terms:
SVM classification, SSK-means clustering algorithm, labeled data
Citation:
Hongcan Yan, Chen Lin, Bicheng Li, "A SVM-Based Text Classification Method with SSK-Means Clustering Algorithm," aici, vol. 2, pp.379-383, 2009 International Conference on Artificial Intelligence and Computational Intelligence, 2009
Usage of this product signifies your acceptance of the Terms of Use.
