loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2007 Seventh IEEE International Conference on Data Mining
Improving Text Classification by Using Encyclopedia Knowledge
Omaha, Nebraska, USA
October 28-October 31
ISBN: 0-7695-3018-4
The exponential growth of text documents available on the Internet has created an urgent need for accurate, fast, and general purpose text classification algorithms. However, the "bag of words" representation used for these classification methods is often unsatisfactory as it ignores relationships between important terms that do not co-occur literally. In order to deal with this problem, we integrate background knowledge in our application: Wikipedia into the process of classifying text documents. The experimental evaluation on Reuters newsfeeds and several other corpus shows that our classification results with encyclopedia knowledge are much better than the baseline "bag of words" methods.
Citation:
Pu Wang, Jian Hu, Hua-Jun Zeng, Lijun Chen, Zheng Chen, "Improving Text Classification by Using Encyclopedia Knowledge," icdm, pp.332-341, 2007 Seventh IEEE International Conference on Data Mining, 2007
Usage of this product signifies your acceptance of the Terms of Use.