loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Third International Conference on Web Information Systems Engineering (Workshops) - (WISEw'02)
Two Odds-Radio-Based Text Classification Algorithms
Singapore
December 11-December 11
ISBN: 0-7695-1813-3
Zhi-Hong Deng, Peking University
Shi-Wei Tang, Peking University
Dong-Qing Yang, Peking University
Ming Zhang, Peking University
Xiao-Bin Wu, Peking University
Meng Yang, Peking University
Since 1990?s, the exponential growth of these Web documents has led to a great deal of interest in developing efficient tools and software to assist users in finding relevant information. Text classification has been proved to be useful in helping organize and search text information on the Web. Although there have been existed a number of text classification algorithms, most of them are either inefficient or too complex. In this paper we present two Odds-Radio-Based text classification algorithms, which are called OR and TF*OR respectively. We have evaluated our algorithm on two text collections and compared it against k-NN and SVM. Experimental results show that OR and TF*OR are competitive with k-NN and SVM. Furthermore, OR and TF*OR is much simpler and faster than them. The results also indicate that it is not TF but relevance factors derived from Odds Radio that play the decisive role in document categorization.
Citation:
Zhi-Hong Deng, Shi-Wei Tang, Dong-Qing Yang, Ming Zhang, Xiao-Bin Wu, Meng Yang, "Two Odds-Radio-Based Text Classification Algorithms," wisew, pp.223, Third International Conference on Web Information Systems Engineering (Workshops) - (WISEw'02), 2002
Usage of this product signifies your acceptance of the Terms of Use.