loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Fourth International Conference on Web Information Systems Engineering (WISE'03)
Enhancing Text Classification Using Synopses Extraction
Roma, Italy
December 10-December 12
ISBN: 0-7695-1999-7
Liping Ma, University of New South Wales
John Shepherd, University of New South Wales
Yanchun Zhang, Victoria University of Technology
This paper describes a novel approach to document classification that uses decision-tree machine learning based on a succinct vector of important terms in each document. The succinct vector itself is generated by a machine-learning approach which builds parsers that can identify significant features in a document by partitioning it into regions based on low-level document characteristics. The fact that the feature vector is succinct overcomes the problem of very large term vectors, which have hindered the application of conventional machine learning to document classification. The fact that the parser can be trained to extract only important terms from documents means that small training sets can be used to achieve the same classification accuracy as with conventional approaches.
Citation:
Liping Ma, John Shepherd, Yanchun Zhang, "Enhancing Text Classification Using Synopses Extraction," wise, pp.115, Fourth International Conference on Web Information Systems Engineering (WISE'03), 2003
Usage of this product signifies your acceptance of the Terms of Use.