2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05) Categorical Term Descriptor: A Proposed Term Weighting Scheme for Feature Selection Compi?gne University of Technology, France September 19-September 22 ISBN: 0-7695-2415-X
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/WI.2005.46
This paper proposes a term weighing scheme, Categorical Term Descriptor (CTD), for feature selection in automated text categorization. CTD is an adatation of the Term Frequency Inverse Document Frequency (TFIDF). We compared the performance of the proposed method against classical methods such as Correlation Coefficient, Chi-Square and Information Gain using the Multinomial Naïve Bayes and the Support Vector Machine (SVM) classifiers on the Reuters [10] and Reuters [115] variants of Reuters-21578 dataset. Despite its simplicity, CTD has proven to be promising for both local and global feature selection CTD works best for the Reuters [10] as a stable local FS method.
Citation:
Bong Chih How, Narayanan Kulathuramaiyer, Wong Ting Kiong, "Categorical Term Descriptor: A Proposed Term Weighting Scheme for Feature Selection," wi, pp.313-316, 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05), 2005 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||