loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05)
Integrating Compound Terms in Bayesian Text Classification
Compi?gne University of Technology, France
September 19-September 22
ISBN: 0-7695-2415-X
Jing Bai, Universit? de Montr?al
Jian-Yun Nie, Universit? de Montr?al
Guihong Cao, Universit? de Montr?al
Text classification usually assumed a word-based document representation. In this paper, we propose a new approach to integrate compound terms in Bayesian text classification. Compound terms are used as complementary features to single words. An acute problem is to consider their dependence with the component words. In this paper, we propose to use smoothing techniques to combine both compound term and word representations. Experiments have been conducted on two corpora. Our results show that this approach can slightly but steadily improve the classification performance on both test corpora.
Citation:
Jing Bai, Jian-Yun Nie, Guihong Cao, "Integrating Compound Terms in Bayesian Text Classification," wi, pp.598-601, 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05), 2005
Usage of this product signifies your acceptance of the Terms of Use.