Fourth International Conference on Hybrid Intelligent Systems (HIS'04)
An Efficient Feature Selection Using Multi-Criteria in Text Categorization
Kitakyushu, Japan
December 05-December 08
ISBN: 0-7695-2291-2
Text categorization is a problem of assigning a document into one or more predefined classes. One of the most interesting issues in text categorization is feature selection. This paper proposes a novel approach in feature selection based on multi-criteria ranking of features. Based on a threshold value for each criterion, a new procedure for feature selection is proposed and applied to a text categorization. Experiments dealing with the Reuters-21578 benchmark data and the naive Bayes algorithm show that the proposed approach outperforms performances in compare to conventional feature selection methods.