loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
International Conference on Information Technology: Coding and Computing (ITCC'04) Volume 1
Exploring Similarity among Web Pages Using the Hyperlink Structure
Las Vegas, Nevada
April 05-April 07
ISBN: 0-7695-2108-8
Shou-Hsuan Stephen Huang, University of Houston
Carlos Humberto Molina-Rodr?guez, Universidad Aut?noma de Guadalajara
Jes?s Ubaldo Quevedo-Torrero, University of Houston
Mario Francisco Fonseca-Lozada, Universidad Aut?noma de Guadalajara
Hyperlinks inside HTML pages contain a wealth of information about the relationships among web pages. Given a set of web pages, we can explore the hyperlink relationships among these pages. This paper first provides formal definitions of hyperlink relations. We then use the notations to define similarity between two web pages and between two sets of web pages. For each one of them, we provide several definitions of similarity using forward and backward links. The similarity measure gives us a number between 0 and 1. We also demonstrate how to use the similarity measure to study clustering within a set of pages and to determine the "diversity" of a set of web pages.
Citation:
Shou-Hsuan Stephen Huang, Carlos Humberto Molina-Rodr?guez, Jes?s Ubaldo Quevedo-Torrero, Mario Francisco Fonseca-Lozada, "Exploring Similarity among Web Pages Using the Hyperlink Structure," itcc, vol. 1, pp.344, International Conference on Information Technology: Coding and Computing (ITCC'04) Volume 1, 2004
Usage of this product signifies your acceptance of the Terms of Use.