loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05)
Metadata Propagation in the Web Using Co-Citations
Compi?gne University of Technology, France
September 19-September 22
ISBN: 0-7695-2415-X
Camille Prime-Claverie, École Nationale Supérieure des Mines
Michel Beigbeder, École Nationale Supérieure des Mines
Thierry Lafouge, Université Claude Bernard Lyon 1
Given the large heterogeneity of the World Wide Web, using metadata on the search engines side seems to be a useful track for information retrieval. Though, because a manual qualification at the Web scale is not accessible, this track is little followed. We propose a semi-automatic method for propagating metadata. In a first step, homegeneous corpus are extracted. We used in our study the following properties: the authority type, the site type, the information type, and the page type. This first step is realized by a clusterization which uses a similarity measure based on the co-citation frequency between pages. Given the cluster hierarchy, the second step selects a reduced number of documents to be manually qualified and propagates the given metadata values to the other documents belonging to the same cluster. A qualitative evaluation and a preliminary study about the scalability of this method are presented.
Citation:
Camille Prime-Claverie, Michel Beigbeder, Thierry Lafouge, "Metadata Propagation in the Web Using Co-Citations," wi, pp.602-605, 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05), 2005
Usage of this product signifies your acceptance of the Terms of Use.