This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Personalizing Web Directories with the Aid of Web Usage Data
September 2010 (vol. 22 no. 9)
pp. 1331-1344
Dimitrios Pierrakos, NCSR "Demokritos'', Athens
Georgios Paliouras, NCSR "Demokritos", Athens
This paper presents a knowledge discovery framework for the construction of Community Web Directories, a concept that we introduced in our recent work, applying personalization to Web directories. In this context, the Web directory is viewed as a thematic hierarchy and personalization is realized by constructing user community models on the basis of usage data. In contrast to most of the work on Web usage mining, the usage data that are analyzed here correspond to user navigation throughout the Web, rather than a particular Web site, exhibiting as a result a high degree of thematic diversity. For modeling the user communities, we introduce a novel methodology that combines the users' browsing behavior with thematic information from the Web directories. Following this methodology, we enhance the clustering and probabilistic approaches presented in previous work and also present a new algorithm that combines these two approaches. The resulting community models take the form of Community Web Directories. The proposed personalization methodology is evaluated both on a specialized artificial and a general-purpose Web directory, indicating its potential value to the Web user. The experiments also assess the effectiveness of the different machine learning techniques on the task.

[1] B. Mobasher, R. Cooley, and J. Srivastava, "Automatic Personalization Based on Web Usage Mining," Comm. ACM, vol. 43, no. 8, pp. 142-151, 2000.
[2] J. Srivastava, R. Cooley, M. Deshpande, and P.T. Tan, "Web Usage Mining: Discovery and Applications of Usage Patterns from Web Data," SIGKDD Explorations, vol. 1, no. 2, pp. 12-23, 2000.
[3] D. Pierrakos, G. Paliouras, C. Papatheodorou, and C.D. Spyropoulos, "Web Usage Mining as a Tool for Personalization: A Survey," User Modeling and User-Adapted Interaction, vol. 13, no. 4, pp. 311-372, 2003.
[4] G. Paliouras, C. Papatheodorou, V. Karkaletsis, and C.D. Spyropoulos, "Discovering User Communities on the Internet Using Unsupervised Machine Learning Techniques," Interacting with Computers J., vol. 14, no. 6, pp. 761-791, 2002.
[5] G. Xu, Y. Zhang, and Y. Xun, "Modeling User Behaviour for Web Recommendation Using lda Model," Proc. IEEE/WIC/ACM Int'l Conf. Web Intelligence and Intelligent Agent Technology, pp. 529-532, 2008.
[6] W. Chu and S.-T.P. Park, "Personalized Recommendation on Dynamic Content Using Predictive Bilinear Models," Proc. 18th Int'l Conf. World Wide Web (WWW), pp. 691-700, 2009.
[7] The Adaptive Web, Methods and Strategies of Web Personalization, P. Brusilovsky, A. Kobsa, and W. Neijdl, eds. Springer, 2007.
[8] D. Pierrakos, G. Paliouras, C. Papatheodorou, V. Karkaletsis, and M. Dikaiakos, "Web Community Directories: A New Approach to Web Personalization," Web Mining: From Web to Semantic Web, B. Berendt et al., eds., pp. 113-129, Springer, 2004.
[9] D. Pierrakos and G. Paliouras, "Exploiting Probabilistic Latent Information for the Construction of Community Web Directories," Proc. 10th Int'l Conf. User Modeling, L. Ardissono, P. Brna, and A. Mitrovic, eds., pp. 89-98, 2005.
[10] C. Christophi, D. Zeinalipour-Yazti, M.D. Dikaiakos, and G. Paliouras, "Automatically Annotating the ODP Web Taxonomy," Proc. 11th Panhellenic Conf. Informatics (PCI '07), 2007.
[11] P.I. Hofgesang, "Online Mining of Web Usage Data: An Overview," Web Mining Applications in E-Commerce and E-Services, pp. 1-24, Springer, 2009.
[12] G. Castellano, A.M. Fanelli, and M.A. Torsello, "Computational Intelligence Techniques for Web Personalization," Web Intelligence and Agent Systems, vol. 6, no. 3, pp. 253-272, 2008.
[13] T. Hofmann, "Learning What People (Don't) Want," Proc. 12th European Conf. in Machine Learning, pp. 214-225, 2001.
[14] X. Jin, Y. Zhou, and B. Mobasher, "Web Usage Mining Based on Probabilistic Latent Semantic Analysis," Proc. ACM SIGKDD, pp. 197-205, Aug. 2004.
[15] X. Jin, Y. Zhou, and B. Mobasher, "Task-Oriented Web User Modeling for Recommendation," Proc. 10th Int'l Conf. User Modeling, L. Ardissono, P. Brna, and A. Mitrovic, eds., pp. 109-118, 2005.
[16] D. Chen, D. Wang, and F. Yu, "A PLSA-Based Approach for Building User Profile and Implementing Personalized Recommendation," Proc. Joint Ninth Asia-Pacific Web Conf. (APWeb '07) and Eighth Int'l Conf. Web-Age Information Management (WAIM '07), pp. 606-613, 2007.
[17] B. Mehta and N. Wolfgang, "Unsupervised Strategies for Shilling Detection and Robust Collaborative Filtering," User Modeling and User-Adapted Interaction, vol. 19, nos. 1/2, pp. 65-97, 2009.
[18] P.A. Chirita, W. Nejdl, R. Paiu, and C. Kohlschütter, "Using odp Metadata to Personalize Search," Proc. 28th Ann. Int'l ACM SIGIR Conf. Research and Development in Information Retrieval, pp. 178-185, 2005.
[19] A. Sieg, B. Mobasher, and R. Burke, "Ontological User Profiles for Representing Context in Web Search," Proc. IEEE/WIC/ACM Int'l Conf. Web Intelligence and Intelligent Agent Technology—Workshops, pp. 91-94, 2007.
[20] Z. Ma, G. Pant, and O.R.L. Sheng, "Interest-Based Personalized Search," ACM Trans. Information Systems, vol. 25, no. 1,article no. 5, Feb. 2007.
[21] T. Oishi, K. Yoshiaki, M. Tsunenori, H. Ryuzo, F. Hiroshi, and M. Koshimura, "Personalized Search Using odp-Based User Profiles Created from User Bookmark," Proc. 10th Pacific Rim Int'l Conf. Artificial Intelligence, pp. 839-848, 2008.
[22] J. Garofalakis, T. Giannakoudi, and A. Vopi, "Personalized Web Search by Constructing Semantic Clusters of User Profiles," Proc. 12th Proc. Int'l Conf. Knowledge-Based Intelligent Information and Eng. Systems, pp. 238-247, 2008.
[23] C.R. Anderson and E. Horvitz, "Web Montage: A Dynamic Personalized Start Page," Proc. 11th Int'l Conf. World Wide Web, pp. 704-712, May 2002.
[24] B. Smyth and C. Cotter, "Personalized Adaptive Navigation for Mobile Portals," Proc. 15th European Conf. Artificial Intelligence, 2002.
[25] J. Chaffee and S. Gauch, "Personal Ontologies for Web Navigation," Proc. Ninth Int'l Conf. Information and Knowledge Management (CIKM '00), pp. 227-234, 2000.
[26] T. Dalamagas, P. Bouros, T. Galanis, M. Eirinaki, and T. Sellis, "Mining User Navigation Patterns for Personalizing Topic Directories," Proc. Ninth Ann. ACM Int'l Workshop Web Information and Data Management, pp. 81-88, 2007.
[27] G. Salton and M. McGill, Introduction to Modern Information Retrieval. McGraw-Hill, 1986.
[28] Y. Zhao and G. Karypis, "Evaluation of Hierarchical Clustering Algorithms for Document Datasets," Proc. ACM Int'l Conf. Information and Knowledge Management (CIKM), pp. 515-524, Nov. 2002,
[29] H.W. Press, S.A. Teukolsky, W.T. Vetterling, and B.P. Flannery, Numerical Recipes in C, second ed. Cambridge Univ. Press, 1992.
[30] C. Bron and J. Kerbosch, "Algorithm 457-Finding All Cliques of an Undirected Graph," Comm. ACM, vol. 16, no. 9, pp. 575-577, 1973.
[31] T. Hofmann, "Probabilistic Latent Semantic Analysis," Proc. 15th Conf. Uncertainty in Artificial Intelligence (UAI' 99), pp. 289-296, 1999.
[32] J.C. Bezdek, Pattern Recognition with Fuzzy Objective Function Algorithms. Kluwer Academic Publishers, 1981.
[33] J. Hartigan, Clustering Algorithms. John Wiley & Sons, 1975.
[34] Y. Zhao and G. Karypis, "Empirical and Theoretical Comparisons of Selected Criterion Functions for Document Clustering," Machine Learning, vol. 55, no. 3, pp. 311-331, 2004.
[35] J.S. Breese, D. Heckerman, and C. Kadie, "Empirical Analysis of Predictive Algorithms for Collaborative Filtering," Proc. 14th Conf. Uncertainty in Artificial Intelligence (UAI '98), pp. 43-52, 1998.

Index Terms:
Machine learning, Web mining, clustering, personalization.
Citation:
Dimitrios Pierrakos, Georgios Paliouras, "Personalizing Web Directories with the Aid of Web Usage Data," IEEE Transactions on Knowledge and Data Engineering, vol. 22, no. 9, pp. 1331-1344, Sept. 2010, doi:10.1109/TKDE.2009.173
Usage of this product signifies your acceptance of the Terms of Use.