22nd International Conference on Advanced Information Networking and Applications - Workshops (aina workshops 2008) An Efficient Feature Selection Using Hidden Topic in Text Categorization March 25-March 28 ISBN: 978-0-7695-3096-3
Text categorization is an important research area in information retrieval. In order to save the storage space and get better accuracy, efficient and effective feature selection methods for reducing the data before analysis are highly desired. Usually, researches on feature selection use only a proper measurement such as information gain. In this paper, we propose a new feature selection method by adopting an attractive hidden topic analysis and entropy-based feature ranking. Experiments dealing with the well-known Reuters-21578 and Ohsumed datasets show that our method can achieve a better classification accuracy while reducing the feature dimension dramatically.
Index Terms:
feature selection
Citation:
Zhiwei Zhang, Xuan-Hieu Phan, Susumu Horiguchi, "An Efficient Feature Selection Using Hidden Topic in Text Categorization," ainaw, pp.1223-1228, 22nd International Conference on Advanced Information Networking and Applications - Workshops (aina workshops 2008), 2008 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||