loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI'06)
Binary Cybergenre Classification Using Theoretic Feature Measures
Hong Kong, China
December 18-December 22
ISBN: 0-7695-2747-7
Lei Dong, Dalhousie University, Canada
Carolyn Watters, Dalhousie University, Canada
Jack Duffy, Dalhousie University, Canada
Michael Shepherd, Dalhousie University, Canada
In this study, we conducted an investigation on automatic genre classification for three common types of web pages addressing the effect of three theoretic feature selection measures, a range of feature set size, and three machine classifiers on the accuracy of the web page classification in the context of a set of controlled experiments. Our results are encouraging and we conclude that for binary classification tasks, at least for these web page genres, it is possible to reach satisfying results with small content-based feature sets generated with a sound feature selection measure and furthermore there is no evidence of interaction between these feature selection measures and the machine classifiers used.
Citation:
Lei Dong, Carolyn Watters, Jack Duffy, Michael Shepherd, "Binary Cybergenre Classification Using Theoretic Feature Measures," wi, pp.313-316, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.