2008 IEEE 24th International Conference on Data Engineering (2008)
Apr. 7, 2008 to Apr. 12, 2008
Erik Vee , Yahoo! Research, Sunnyvale, CA, USA. email@example.com
Utkarsh Srivastava , Yahoo! Research, Sunnyvale, CA, USA. firstname.lastname@example.org
Jayavel Shanmugasundaram , Yahoo! Research, Sunnyvale, CA, USA. email@example.com
Prashant Bhat , Yahoo! Research, Sunnyvale, CA, USA. firstname.lastname@example.org
Sihem Amer Yahia , Yahoo! Research, Sunnyvale, CA, USA. email@example.com
We study the problem of efficiently computing diverse query results in online shopping applications, where users specify queries through a form interface that allows a mix of structured and content-based selection conditions. Intuitively, the goal of diverse query answering is to return a representative set of top-k answers from all the tuples that satisfy the user selection condition. For example, if a user is searching for Honda cars and we can only display five results, we wish to return cars from five different Honda models, as opposed to returning cars from only one or two Honda models. A key contribution of this paper is to formally define the notion of diversity, and to show that existing score based techniques commonly used in web applications are not sufficient to guarantee diversity. Another contribution of this paper is to develop novel and efficient query processing techniques that guarantee diversity. Our experimental results using Yahoo! Autos data show that our proposed techniques are scalable and efficient.
U. Srivastava, P. Bhat, S. A. Yahia, J. Shanmugasundaram and E. Vee, "Efficient Computation of Diverse Query Results," 2008 IEEE 24th International Conference on Data Engineering(ICDE), Cancun, Mexico, 2008, pp. 228-236.