loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
15th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'03)
Classification of Web Documents Using a Naive Bayes Method
Sacramento, California, USA
November 03-November 05
ISBN: 0-7695-2038-3
Yong Wang, Mississippi State University
Julia Hodges, Mississippi State University
Bo Tang, Mississippi State University
This paper presents an automatic document classification system, WebDoc, which classifies Web documents according to the Library of Congress classification scheme. WebDoc constructs a knowledge base from the training data and then classifies the documents based on information in the knowledge base. One of the classification algorithms used in WebDoc is based on Bayes? theorem from probability theory. This paper focuses upon three aspects of this approach: different event models for the naive Bayes method, different probability smoothing methods, and different feature selection methods. In this paper, we report the performance of each method in terms of recall, precision, and F-measures. Experimental results show that the WebDoc system can classify Web documents effectively and efficiently.
Citation:
Yong Wang, Julia Hodges, Bo Tang, "Classification of Web Documents Using a Naive Bayes Method," ictai, pp.560, 15th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'03), 2003
Usage of this product signifies your acceptance of the Terms of Use.