Los Angeles, CA
March 31, 2009 to April 2, 2009
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/CSIE.2009.22
For resolving the low classification efficiency to domain-specific documents for traditional text categorization algorithms (like KNN, SVM and etc), this paper presents a new text classifier with high performance oriented domain-specific documents. The algorithm is mainly depended on the weight and weight factor of hierarchy feature words in documents. Different hierarchy feature words which have different weights are collected by processing to corpus and classification tree. The weight factor is gained by machine learning method to corpus and knowledge base. Classification experiment to metrological documents shows that new classifier outperforms the KNN.
text classification; machne learning; KNN
Zengmin Geng, Jujian Zhang, Xuefei Li, Jianxia Du, "Research on Domain-Specific Text Classifier", CSIE, 2009, 2009 WRI World Congress on Computer Science and Information Engineering, CSIE, 2009 WRI World Congress on Computer Science and Information Engineering, CSIE 2009, pp. 559-563, doi:10.1109/CSIE.2009.22