Issue No. 05 - May (2014 vol. 26)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TKDE.2013.44
Marina Drosou , Comput. Sci. Dept., Univ. of Ioannina, Ioannina, Greece
Evaggelia Pitoura , Comput. Sci. Dept., Univ. of Ioannina, Ioannina, Greece
Result diversification has recently attracted considerable attention as a means of increasing user satisfaction in recommender systems, as well as in web and database search. In this paper, we focus on the problem of selecting the k-most diverse items from a result set. Whereas previous research has mainly considered the static version of the problem, in this paper, we exploit the dynamic case in which the result set changes over time, as for example, in the case of notification services. We define the CONTINUOUS k-DIVERSITY PROBLEM along with appropriate constraints that enforce continuity requirements on the diversified results. Our proposed approach is based on cover trees and supports dynamic item insertion and deletion. The diversification problem is in general NP-hard; we provide theoretical bounds that characterize the quality of our cover tree solution with respect to the optimal one. Since results are often associated with a relevance score, we extend our approach to account for relevance. Finally, we report experimental results concerning the efficiency and effectiveness of our approach on a variety of real and synthetic datasets.
tree data structures, computational complexity, database management systems, human factors, information filtering, Internet, optimisation, recommender systems
M. Drosou and E. Pitoura, "Diverse Set Selection Over Dynamic Data," in IEEE Transactions on Knowledge & Data Engineering, vol. 26, no. 5, pp. 1102-1116, 2014.