loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05)
Categorical Term Descriptor: A Proposed Term Weighting Scheme for Feature Selection
Compi?gne University of Technology, France
September 19-September 22
ISBN: 0-7695-2415-X
Bong Chih How, Universiti Malaysia Sarawak
Narayanan Kulathuramaiyer, Universiti Malaysia Sarawak
Wong Ting Kiong, Universiti Malaysia Sarawak
This paper proposes a term weighing scheme, Categorical Term Descriptor (CTD), for feature selection in automated text categorization. CTD is an adatation of the Term Frequency Inverse Document Frequency (TFIDF). We compared the performance of the proposed method against classical methods such as Correlation Coefficient, Chi-Square and Information Gain using the Multinomial Naïve Bayes and the Support Vector Machine (SVM) classifiers on the Reuters [10] and Reuters [115] variants of Reuters-21578 dataset. Despite its simplicity, CTD has proven to be promising for both local and global feature selection CTD works best for the Reuters [10] as a stable local FS method.
Citation:
Bong Chih How, Narayanan Kulathuramaiyer, Wong Ting Kiong, "Categorical Term Descriptor: A Proposed Term Weighting Scheme for Feature Selection," wi, pp.313-316, 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05), 2005
Usage of this product signifies your acceptance of the Terms of Use.