The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.09 - Sept. (2012 vol.24)
pp: 1671-1685
Aditya Telang , University of Texas at Arlington, Arlington
Chengkai Li , University of Texas at Arlington, Arlington
Sharma Chakravarthy , University of Texas at Arlington, Arlington
ABSTRACT
With the emergence of the deep web, searching web databases in domains such as vehicles, real estate, etc., has become a routine task. One of the problems in this context is ranking the results of a user query. Earlier approaches for addressing this problem have used frequencies of database values, query logs, and user profiles. A common thread in most of these approaches is that ranking is done in a user- and/or query-independent manner. This paper proposes a novel query- and user-dependent approach for ranking query results in web databases. We present a ranking model, based on two complementary notions of user and query similarity, to derive a ranking function for a given user query. This function is acquired from a sparse workload comprising of several such ranking functions derived for various user-query pairs. The model is based on the intuition that similar users display comparable ranking preferences over the results of similar queries. We define these similarities formally in alternative ways and discuss their effectiveness analytically and experimentally over two distinct web databases.
INDEX TERMS
Databases, Mathematical model, Vehicles, Context, Equations, Image color analysis, Color, workload, Automated ranking, web databases, user similarity, query similarity
CITATION
Aditya Telang, Chengkai Li, Sharma Chakravarthy, "One Size Does Not Fit All: Toward User- and Query-Dependent Ranking for Web Databases", IEEE Transactions on Knowledge & Data Engineering, vol.24, no. 9, pp. 1671-1685, Sept. 2012, doi:10.1109/TKDE.2011.36
REFERENCES
[1] S. Agrawal, S. Chaudhuri, G. Das, and A. Gionis, "Automated Ranking of Database Query Results," Proc. Conf. Innovative Data Systems Research (CIDR), 2003.
[2] S. Amer-Yahia, A. Galland, J. Stoyanovich, and C. Yu, "From del.icio.us to x.qui.site: Recommendations in Social Tagging Sites," Proc. ACM SIGMOD Int'l Conf. Management of Data, pp. 1323-1326, 2008.
[3] R. Baeza-Yates and B. Ribeiro-Neto, Modern Information Retrieval. ACM Press, 1999.
[4] M. Balabanovic and Y. Shoham, "Content-Based Collaborative Recommendation," Comm. ACM, vol. 40, no. 3, pp. 66-72, 1997.
[5] J. Basilico and T. Hofmann, "A Joint Framework for Collaborative and Content Filtering," Proc. 27th Ann. Int'l ACM SIGIR Conf. Research and Development in Information Retrieval, pp. 550-551, 2004.
[6] C. Basu, H. Hirsh, and W.W. Cohen, "Recommendation as Classification: Using Social and Content-Based Information in Recommendation," Proc. 15th Nat'l Conf. Artificial Intelligence (AAAI/IAAI), pp. 714-720, 1998.
[7] M.K. Bergman, "The Deep Web: Surfacing Hidden Value," J. Electronic Publishing, vol. 7, no. 1, pp. 41-50, 2001.
[8] D. Billsus and M.J. Pazzani, "Learning Collaborative Information Filters," Proc. Int'l Conf. Machine Learning (ICML), pp. 46-54, 1998.
[9] K.C.-C. Chang, B. He, C. Li, M. Patil, and Z. Zhang, "Structured Databases on the Web: Observations and Implications," SIGMOD Record, vol. 33, no. 3, pp. 61-70, 2004.
[10] S. Chaudhuri, G. Das, V. Hristidis, and G. Weikum, "Probabilistic Ranking of Database Query Results," Proc. 30th Int'l Conf. Very Large Data Bases (VLDB), pp. 888-899, 2004.
[11] S. Chaudhuri, G. Das, V. Hristidis, and G. Weikum, "Probabilistic Information Retrieval Approach for Ranking of Database Query Results," ACM Trans. Database Systems, vol. 31, no. 3, pp. 1134-1168, 2006.
[12] C. Dwork, R. Kumar, M. Naor, and D. Sivakumar, "Rank Aggregation Methods for the Web," Proc. Int'l Conf. World Wide Web (WWW), pp. 613-622, 2001.
[13] N. Fuhr, "A Probabilistic Framework for Vague Queries and Imprecise Information in Databases," Proc. 16th Int'l Conf. Very Large Data Bases (VLDB), pp. 696-707, 1990.
[14] S. Gauch and M. Speretta, "User Profiles for Personalized Information Access," Adaptive Web, pp. 54-89, 2007.
[15] Google, Google Base, http://www.google.combase, 2012.
[16] B. He, "Relevance Feedback," Encyclopedia of Database Systems, pp. 2378-2379, Springer, 2009.
[17] T. Hofmann, "Collaborative Filtering via Gaussian Probabilistic Latent Semantic Analysis," Proc. 26th Ann. Int'l ACM SIGIR Conf. Research and Development in Information Retrieval, pp. 259-266, 2003.
[18] S.-W. Hwang, "Supporting Ranking For Data Retrieval," PhD thesis, Univ. of Illinois, Urbana Champaign, 2005.
[19] T. Kanungo and D. Mount, "An Efficient K-Means Clustering Algorithm: Analysis and Implementation," IEEE Trans. of Pattern Analysis in Machine Intelligence, vol. 24, no. 7, pp. 881-892, July 2002.
[20] G. Koutrika and Y.E. Ioannidis, "Constrained Optimalities in Query Personalization," Proc. ACM SIGMOD Int'l Conf. Management of Data, pp. 73-84, 2005.
[21] C. Li, K.C.-C. Chang, I.F. Ilyas, and S. Song, "Ranksql: Query Algebra and Optimization for Relational Top-k Queries," Proc. ACM SIGMOD Int'l Conf. Management of Data, pp. 131-142, 2005.
[22] X. Luo, X. Wei, and J. Zhang, "Guided Game-Based Learning Using Fuzzy Cognitive Maps," IEEE Trans. Learning Technologies, vol. 3, no. 4, pp. 344-357, Oct.-Dec. 2010.
[23] A. Marian, N. Bruno, and L. Gravano, "Evaluating Top-k Queries over Web-Accessible Databases," ACM Trans. Database Systems, vol. 29, no. 2, pp. 319-362, 2004.
[24] A. Penev and R.K. Wong, "Finding Similar Pages in a Social Tagging Repository," Proc. 17th Int'l Conf. World Wide Web (WWW), pp. 1091-1092, 2008.
[25] Y. Rui, T.S. Huang, and S. Mehrotra, "Content-Based Image Retrieval with Relevance Feedback in Mars," Proc. IEEE Int'l Conf. Image Processing, pp. 815-818, 1997.
[26] X. Shi and C.C. Yang, "Mining Related Queries from Web Search Engine Query Logs Using an Improved Association Rule Mining Model," J. Am. Soc. Information Science Technology, vol. 58, pp. 1871-1883, Oct. 2007.
[27] W. Su, J. Wang, Q. Huang, and F. Lochovsky, "Query Result Ranking over E-Commerce Web Databases," Proc. Conf. Information and Knowledge Management (CIKM), pp. 575-584, 2006.
[28] A. Telang, S. Chakravarthy, and C. Li, "Establishing a Workload for Ranking in Web Databases," technical report, UT Arlington, http://cse.uta.edu/research/Publications/ DownloadsCSE-2010-3.pdf, 2010.
[29] A. Telang, C. Li, and S. Chakravarthy, "One Size Does not Fit All: Towards User- and Query-Dependent Ranking for Web Databases," technical report, UT Arlington, http://cse.uta.edu/ research/Publications/ DownloadsCSE-2009-6.pdf, 2009.
[30] K. Werner, "Foundations of Preferences in Database Systems," Proc. 28th Int'l Conf. Very Large Data Bases (VLDB), pp. 311-322, 2002.
[31] L. Wu et al., "Falcon: Feedback Adaptive Loop for Content-Based Retrieval," Proc. Int'l Conf. Very Large Data Bases (VLDB), pp. 297-306, 2000.
[32] Z. Xu, X. Luo, and W. Lu, "Association Link Network: An Incremental Semantic Data Model on Organizing Web Resources," Proc. Int'l Conf. Parallel and Distributed Systems (ICPADS), pp. 793-798, 2010.
[33] H. Yu, S.-w. Hwang, and K.C.-C. Chang, "Enabling Soft Queries for Data Retrieval," Information Systems, vol. 32, no. 4, pp. 560-574, 2007.
[34] H. Yu, Y. Kim, and S. won Hwang, "Rv-svm: An Efficient Method for Learning Ranking Svm," Proc. Pacific-Asia Conf. Knowledge Discovery and Data Mining (PAKDD), pp. 426-438, 2009.
[35] T.C. Zhou, H. Ma, M.R. Lyu, and I. King, "Userrec: A User Recommendation Framework in Social Tagging Systems," Proc. 24th AAAI Conf. Artificial Intelligence, 2010.
10 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool