The Community for Technology Leaders
2008 IEEE 24th International Conference on Data Engineering (2008)
Cancun, Mexico
Apr. 7, 2008 to Apr. 12, 2008
ISBN: 978-1-4244-1836-7
pp: 228-236
Erik Vee , Yahoo! Research, Sunnyvale, CA, USA. erikvee@yahoo-inc.com
Utkarsh Srivastava , Yahoo! Research, Sunnyvale, CA, USA. utkarsh@yahoo-inc.com
Jayavel Shanmugasundaram , Yahoo! Research, Sunnyvale, CA, USA. jaishan@yahoo-inc.com
Prashant Bhat , Yahoo! Research, Sunnyvale, CA, USA. pbhat@yahoo-inc.com
Sihem Amer Yahia , Yahoo! Research, Sunnyvale, CA, USA. sihem@yahoo-inc.com
ABSTRACT
We study the problem of efficiently computing diverse query results in online shopping applications, where users specify queries through a form interface that allows a mix of structured and content-based selection conditions. Intuitively, the goal of diverse query answering is to return a representative set of top-k answers from all the tuples that satisfy the user selection condition. For example, if a user is searching for Honda cars and we can only display five results, we wish to return cars from five different Honda models, as opposed to returning cars from only one or two Honda models. A key contribution of this paper is to formally define the notion of diversity, and to show that existing score based techniques commonly used in web applications are not sufficient to guarantee diversity. Another contribution of this paper is to develop novel and efficient query processing techniques that guarantee diversity. Our experimental results using Yahoo! Autos data show that our proposed techniques are scalable and efficient.
INDEX TERMS
CITATION

U. Srivastava, P. Bhat, S. A. Yahia, J. Shanmugasundaram and E. Vee, "Efficient Computation of Diverse Query Results," 2008 IEEE 24th International Conference on Data Engineering(ICDE), Cancun, Mexico, 2008, pp. 228-236.
doi:10.1109/ICDE.2008.4497431
89 ms
(Ver 3.3 (11022016))