|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
Fifth IEEE International Conference on Data Mining (ICDM'05)
Efficient Text Classification by Weighted Proximal SVM
Houston, Texas
November 27-November 30
ISBN: 0-7695-2278-5
| ASCII Text | x | ||
| Dong Zhuang, Benyu Zhang, Qiang Yang, Jun Yan, Zheng Chen, Ying Chen, "Efficient Text Classification by Weighted Proximal SVM," Data Mining, IEEE International Conference on, pp. 538-545, Fifth IEEE International Conference on Data Mining (ICDM'05), 2005. | |||
| BibTex | x | ||
| @article{ 10.1109/ICDM.2005.56, author = {Dong Zhuang and Benyu Zhang and Qiang Yang and Jun Yan and Zheng Chen and Ying Chen}, title = {Efficient Text Classification by Weighted Proximal SVM}, journal ={Data Mining, IEEE International Conference on}, volume = {0}, year = {2005}, issn = {1550-4786}, pages = {538-545}, doi = {http://doi.ieeecomputersociety.org/10.1109/ICDM.2005.56}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - Data Mining, IEEE International Conference on TI - Efficient Text Classification by Weighted Proximal SVM SN - 1550-4786 SP538 EP545 A1 - Dong Zhuang, A1 - Benyu Zhang, A1 - Qiang Yang, A1 - Jun Yan, A1 - Zheng Chen, A1 - Ying Chen, PY - 2005 KW - null VL - 0 JA - Data Mining, IEEE International Conference on ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICDM.2005.56
In this paper, we present an algorithm that can classify large-scale text data with high classification quality and fast training speed. Our method is based on a novel extension of the proximal SVM mode [3]. Previous studies on proximal SVM have focused on classification for low dimensional data and did not consider the unbalanced data cases. Such methods will meet difficulties when classifying unbalanced and high dimensional data sets such as text documents. In this work, we extend the original proximal SVM by learning a weight for each training error. We show that the classification algorithm based on this model is capable of handling high dimensional and unbalanced data. In the experiments, we compare our method with the original proximal SVM (as a special case of our algorithm) and the standard SVM (such as SVM light) on the recently published RCV1-v2 dataset. The results show that our proposed method had comparable classification quality with the standard SVM. At the same time, both the time and memory consumption of our method are less than that of the standard SVM.
Citation:
Dong Zhuang, Benyu Zhang, Qiang Yang, Jun Yan, Zheng Chen, Ying Chen, "Efficient Text Classification by Weighted Proximal SVM," icdm, pp.538-545, Fifth IEEE International Conference on Data Mining (ICDM'05), 2005
Usage of this product signifies your acceptance of the Terms of Use.
