loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
17th International Conference on Pattern Recognition (ICPR'04) - Volume 2
Applying the Conjugate Gradient Method for Text Document Categorization
Cambridge UK
August 23-August 26
ISBN: 0-7695-2128-2
Vincent Tam, The University of Hong Kong, Pokfulam, Hong Kong
Rudy Setiono, The National University of Singapore
A. Santoso, The National University of Singapore
In this paper, we investigate the effectiveness of two different methods to solve the linear least squares fit (LLSF) problem for document categorization. The first method is the Singular Value Decomposition (SVD) method that has been previously used to solve the document categorization problem. The second method is the Conjugate Gradient (CG) method that is one of the most effective algorithms for solving a linear equation problem. However, up to our knowledge, the CG method has never been applied to handle the document classification problem. Therefore, we compare the effectiveness of these two LLSF methods to categorize text documents. In addition, we examine the effect of using different term weighting schemes on their performance for document classification. Lastly, we compare the performance of the LLSF classifiers agaisnt the neighborhood-based Dt-kNN classifier, our best variant of the kNN classifier integrated with a dynamic threshold scheme, on the Reuters 21578 dataset. Besides being the first proposal to use the CG method for document classification, our work opens up many exciting directions for future investigation.
Index Terms:
Document Classification, Linear Least Squares Fit, Conjugate Gradient Method, Performance Measures
Citation:
Vincent Tam, Rudy Setiono, A. Santoso, "Applying the Conjugate Gradient Method for Text Document Categorization," icpr, vol. 2, pp.558-561, 17th International Conference on Pattern Recognition (ICPR'04) - Volume 2, 2004
Usage of this product signifies your acceptance of the Terms of Use.