loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
15th International Workshop on Research Issues in Data Engineering: Stream Data Mining and Applications (RIDE-SDMA'05)
Using Probabilistic Latent Semantic Analysis for Web Page Grouping
Tokyo, Japan
April 03-April 04
ISBN: 0-7695-2390-0
Guandong Xu, Victoria University and Wenzhou University
Yanchun Zhang, Victoria University
Xiaofang Zhou, University of Queensland
The locality of web pages within a web site is initially determined by the designer?s expectation. Web usage mining can discover the patterns in the navigational behaviour of web visitors, in turn, improve web site functionality and service designing by considering users? actual opinion. Conventional web page clustering technique is often utilized to reveal the functional similarity of web pages. However, high-dimensional computation problem will be incurred due to taking user transaction as dimension. In this paper, we propose a new web page grouping approach based on Probabilistic Latent Semantic Analysis (PLSA) model. An iterative algorithm based on maximum likelihood principle is employed to overcome the aforementioned computational shortcoming. The web pages are classified into various groups according to user access patterns. Meanwhile, the semantic latent factors or tasks are characterized by extracting the content of "dominant" pages related to the factors. We demonstrate the effectiveness of our approach by conducting experiments on real world data sets.
Citation:
Guandong Xu, Yanchun Zhang, Xiaofang Zhou, "Using Probabilistic Latent Semantic Analysis for Web Page Grouping," ride, pp.29-36, 15th International Workshop on Research Issues in Data Engineering: Stream Data Mining and Applications (RIDE-SDMA'05), 2005
Usage of this product signifies your acceptance of the Terms of Use.