This Article 
 Bibliographic References 
 Add to: 
Mining Ontology for Automatically Acquiring Web User Information Needs
April 2006 (vol. 18 no. 4)
pp. 554-568
It is not easy to obtain the right information from the Web for a particular Web user or a group of users due to the obstacle of automatically acquiring Web user profiles. The current techniques do not provide satisfactory structures for mining Web user profiles. This paper presents a novel approach for this problem. The objective of the approach is to automatically discover ontologies from data sets in order to build complete concept models for Web user information needs. It also proposes a method for capturing evolving patterns to refine discovered ontologies. In addition, the process of assessing relevance in ontology is established. This paper provides both theoretical and experimental evaluations for the approach. The experimental results show that all objectives we expect for the approach are achievable.

[1] R. Baeza-Yates and B. Ribeiro-Neto, Modern Information Retrieval. Addison Wesley, 1999.
[2] G. Chang, M.J. Healey, J.A.M. McHugh, and J.T.L. Wang, Mining the World Wide Web: An Information Search Approach. Kluwer Academic, 2001.
[3] M.S. Chen, J. Park, and P.S. Yu, “Data Mining For Path Traversal Patterns In A Web Environment,” Proc. 16th Int'l Conf. Distributed Computing Systems, pp. 385-392, 1996.
[4] M. Craven, “Learning to Extract Symbolic Knowledge from the World Wide Web,” Artificial Intelligence, vol. 118, pp. 69-113, 2000.
[5] R. Feldman et. al., “Maximal Association Rules: A New Tool for Mining for Keyword Co-Occurrences in Document Collection,” Proc. Third ACM SIGKDD Int'l Conf. Knowledge Discovery, pp. 167-170, 1997.
[6] R. Feldman, I. Dagen, and H. Hirsh, “Mining Text Using Keywords Distributions,” J. Intelligent Information Systems, vol. 10, no. 3, pp. 281-300, 1998.
[7] M.N. Garofalakis, R. Rastogi, S. Seshadri, and K. Shim, “Data Mining and the Web: Past, Present and Future,” Proc. ACM CIKM Int'l Workshop Web Information and Data Management, pp. 43-47, 1999.
[8] D.A. Grossman and O. Frieder, Information Retrieval Algorithms and Heuristics. Kluwer Academic, 1998.
[9] N. Govert and G. Kazai, “Overview of the Initiative for Evaluation of XML Retrieval (INEX),” Proc. First Workshop Initiative for the Evaluation of XML Retrieval, pp. 1-17, 2002.
[10] J.W. Guan, D.A. Bell, and D.Y. Liu, “The Rough Set Approach to Association Rules,” Proc. Thrid IEEE Int'l Conf. Data Mining, pp. 529-532, 2003.
[11] J.D. Holt and S.M. Chung, “Multipass Algorithms for Mining Association Rules in Text Databases,” Knowledge and Information Systems, vol. 3, pp. 168-183, 2001.
[12] D.A. Hull and S. Roberston, “The TREC-8 Filtering Track Final Report,” Proc. Text Retrieval Conf. (TREC-8), 1999.
[13] R. Janicki, “Towards a Mereological System for Direct Products and Relations,” Proc. Third Int'l Conf. Rough Sets and Current Trends in Computing, pp. 113-122, 2002.
[14] K.S. Jones, “Information Retrieval and Artificial Intelligence,” Artificial Intelligence, vol. 114, nos. 1-2, pp. 257-281, 1999.
[15] R. Kruse, E. Schwecke, and J. Heinsoln, Uncertainty and Vagueness in Knowledge Based Systems. Springer-Verlag, 1991.
[16] K.C. Lee, J.S. Kim, N.H. Chung, and S.J. Kwon, “Fussy Cognitive Map Approach to Web-Mining Inference Amplification,” Expert System with Applications, vol. 22, pp. 197-211, 2002.
[17] X. Li and B. Liu, “Learning to Classify Texts Using Positive and Unlabeled Data,” Proc. Int'l Joint Conf. Artificial Intelligence, pp. 587-592, 2003.
[18] Y. Li, “Extended Random Sets for Knowledge Discovery in Information Systems,” Proc. Ninth Int'l Conf. Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing, pp. 524-532, 2003.
[19] Y. Li and Y.Y. Yao, “User Profile Model: A View from Artificial Intelligence,” Proc. Third Int'l Conf. Rough Sets and Current Trends in Computing, pp. 493-496, 2002.
[20] Y. Li, C. Zhang, and J.R. Swan, “An Information Filtering Model on the Web and Its Application in JobAgent,” Knowledge-Based Systems, vol. 13, no. 5, pp. 285-296, 2000.
[21] Y. Li and N. Zhong, “Ontology-Based Web Mining Model: Representations of User Profiles,” Proc. IEEE/WIC Int'l Conf. Web Intelligence, pp. 96-103, 2003.
[22] Y. Li and N. Zhong, “Interpretations of Association Rules by Granular Computing,” Proc. Third IEEE Int'l Conf. Data Mining, pp. 593-596, 2003.
[23] Y. Li and N. Zhong, “Capturing Evolving Patterns for Ontology-Based Web Mining,” Proc. IEEE/WIC/ACM Int'l Conf. Web Intelligence, pp. 256-263, 2004.
[24] Y. Li and N. Zhong, “Web Mining Model and Its Applications on Information Gathering,” Knowledge-Based Systems, vol. 17, pp. 207-217, 2004.
[25] T.Y. Lin, “Database Mining on Derived Attributes,” Proc. Third Int'l Conf. Rough Sets and Current Trends in Computing, pp. 14-32, 2002.
[26] B. Liu and K. Chang, SIGKDD Explorations, special issue on Web content mining, vol. 6, no. 2, pp. 1-4, 2004.
[27] Z.Y. Lu, Y.Y. Yao, and N. Zhong, “Web Log Mining,” Web Intelligence, pp. 174-194, 2003.
[28] S.M. Madria, S.S. Bhowmick, W.K. Ng, and E.-P. Lim, “Research Issues in Web Data Mining,” Proc. First Int'l Conf. Data Warehousing and Knowledge Discovery, pp. 303-312, 1999.
[29] A. Maedche, Ontology Learning for the Semantic Web. Kluwer Academic, 2003.
[30] A. Maedche, V. Pekar, and S. Staab, “Ontology Learning Part One— On Discovering Taxonomic Relations from the Web,” Web Intelligence, pp. 302-319, 2003.
[31] H. Mannila and H. Toivonen, “Discovering Generalized Episodes Using Minimal Occurrences,” Proc. Second Int'l Conf. Knowledge Discovery and Data Mining, pp. 146-151, 1996.
[32] C. Manning and H. Schuetze, Foundations of Statistical Natural Language Processing. MIT Press, 1999.
[33] J. Mostafa, W. Lam, and M. Palakal, “A Multilevel Approach to Intelligent Information Filtering: Model, System, and Evaluation,” ACM Trans. Information Systems, vol. 15, no. 4, pp. 368-399, 1997.
[34] H. Motoda, “Active Mining,” Frontiers in Artificial Intelligence and Application, no. 79, IOS Press, 2002.
[35] N.F. Noy and M.A. Musen, “PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment,” Proc. 15th Nat'l Conf. Artificial Intelligence, 2000.
[36] S.K. Pal and V. Talwar, “Web Mining in Soft Computing Framework: Relevance, State of the Art and Future Directions,” IEEE Trans. Neural Networks, vol. 13, no. 5, pp. 1163-1177, 2002.
[37] Z. Pawlak, “Rough Sets and Intelligent Data Analysis,” Int'l J. Information and Sciences, vol. 147, pp. 1-12, 2002.
[38] Z. Pawlak, “In Pursuit of Patterns in Data Reasoning from Data— The Rough Set Way,” Proc. Third Int'l Conf. Rough Sets and Current Trends in Computing, pp. 1-9, 2002.
[39] M. Perkowitz and O. Etzioni, “Adaptive Web Sites,” Comm. ACM, vol. 43, no. 8, pp. 152-158, 2002.
[40] S. Robertson and D.A. Hull, “The TREC-9 Filtering Track Final Report,” Proc. Text Retrieval Conf. (TREC-9), 2000.
[41] S. Schocken and R.A. Hummel, “On the Use of the Dempster Shafer Model in Information Indexing and Retrieval Applications,” Int'l J. Man-Machine Studies, vol. 39, pp. 843-879, 1993.
[42] F. Sebastiani, “Machine Learning in Automated Text Categorization,” ACM Computing Surveys, vol. 34, no. 1, pp. 1-47, 2002.
[43] J. Srivastava, R. Cooley, M. Deshpande, and P.-N. Tan, “Web Usage Mining: Discovery and Applications of Usage Pattern from Web Data,” SIGKDD Explorations, vol. 1, no. 2, pp. 12-23, 2002.
[44] S. Staab and R. Studer, Handbook on Ontologies. Springer, 2004.
[45] S. Tsumoto and S. Hirano, “Visualization of Rule's Similarity Using Multidimensional Scaling,” Proc. Third IEEE Int'l Conf. Data Mining, pp. 339-346, 2003.
[46] P. Tzvetkov, X. Yan, and J. Han, “TSP: Mining Top-K Closed Sequential Patterns,” Proc. Third IEEE Int'l Conf. Data Mining, pp. 347-354, 2003.
[47] S.-T. Wu, Y. Li, Y. Xu, B. Pham, and P. Chen, “Automatic Pattern Taxonomy Exatraction for Web Mining,” Proc. IEEE/WIC/ACM Int'l Conf. Web Intelligence, pp. 242-248, 2004.
[48] T.Y. Yan, M. Jacobsen, H. Garcia-Molina, and U. Dayal, “From User Access Patterns to Dynamic Hypertext Linking,” Proc. Fifth Int'l World Wide Web Conf., 1996.
[49] Q. Yang, H. Zhang, I. Tian, and Y. Li, “Mining Web Logs for Prediction Models in WWW Caching and Prefetching,” Proc. Seventh ACM SIGKDD Int'l Conf. Knowledge Discovery and Data Mining, pp. 473-478, 2001.
[50] N. Zhong, “Representation and Construction of Ontologies for Web Intelligence,” Int'l J. Foundation of Computer Science, vol. 13, no. 4, pp. 555-570, 2002.
[51] N. Zhong, J. Liu, and Y.Y. Yao, “In Search of the Wisdom Web,” Computer, vol. 35, no. 11, pp. 27-31, Nov. 2002.

Index Terms:
Web intelligence, ontology mining, Web mining, Web user profiles.
Yuefeng Li, Ning Zhong, "Mining Ontology for Automatically Acquiring Web User Information Needs," IEEE Transactions on Knowledge and Data Engineering, vol. 18, no. 4, pp. 554-568, April 2006, doi:10.1109/TKDE.2006.62
Usage of this product signifies your acceptance of the Terms of Use.