This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Sixth International Conference on Parallel and Distributed Computing Applications and Technologies (PDCAT'05)
Web Document Classification Based on Extended Rough set
Dalian, China
December 05-December 08
ISBN: 0-7695-2405-2
Gaoxiang Yi, Huazhong University of Science and technology,Wuhan,Hubei, China
Heping Hu, Huazhong University of Science and technology,Wuhan,Hubei, China
Zhengding Lu, Huazhong University of Science and technology,Wuhan,Hubei, China
A VSM algorithm for Web document classification based on an extended rough set --Tolerance Rough Set is proposed. Firstly, Web document are denoted by vector space model with terms. Then the value of term co-occurrence is made used of description of tolerance class of term, which extends the capability of term to document. Finally, Web document classification algorithm is implemented, in which the similarity between documents is described by term tolerance class. Experiments using data sets collected from two Web portals: Yahoo and Open Directory Project are conducted.
Citation:
Gaoxiang Yi, Heping Hu, Zhengding Lu, "Web Document Classification Based on Extended Rough set," pdcat, pp.916-919, Sixth International Conference on Parallel and Distributed Computing Applications and Technologies (PDCAT'05), 2005
Usage of this product signifies your acceptance of the Terms of Use.