loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2003 IEEE/WIC International Conference on Web Intelligence (WI'03)
Querying and Clustering Web Pages about Persons and Organizations
Halifax, Canada
October 13-October 17
ISBN: 0-7695-1932-6
Shiren Ye, National University of Singapore
Tat-seng Chua, National University of Singapore
Jeremy R. Kei, National University of Singapore
One of the most frequent Web surfing tasks is to search for names of persons and organizations. Such names are often not distinctive, commonly occurring, and nonunique. Thus, a single name may be mapped to several entities. The paper describes a methodology to cluster the Web pages returned by the search engine so that pages belonging to different entities are clustered into different groups. The algorithm uses a combination of named entities, link-based and structure-based information as features to partition the document set into direct and indirect pages using a decision model. It then uses the distinct direct pages as seeds to cluster the document set into different clusters. The algorithm has been found to be effective for Web-based applications.
Citation:
Shiren Ye, Tat-seng Chua, Jeremy R. Kei, "Querying and Clustering Web Pages about Persons and Organizations," wi, pp.344, 2003 IEEE/WIC International Conference on Web Intelligence (WI'03), 2003
Usage of this product signifies your acceptance of the Terms of Use.