The Community for Technology Leaders
RSS Icon
Issue No.07 - July (2010 vol.22)
pp: 1041-1055
Sze Man Yuen , Chinese University of Hong Kong, Hong Kong
Yufei Tao , Chinese University of Hong Kong, Hong Kong
Xiaokui Xiao , Nanyang Technological University
Jian Pei , Simon Fraser University, Burnaby
Donghui Zhang , Northeastern University, Boston
This paper proposes a new problem, called superseding nearest neighbor search, on uncertain spatial databases, where each object is described by a multidimensional probability density function. Given a query point q, an object is a nearest neighbor (NN) candidate if it has a nonzero probability to be the NN of q. Given two NN-candidates o_1 and o_2, o_1 supersedes o_2 if o_1 is more likely to be closer to q. An object is a superseding nearest neighbor (SNN) of q, if it supersedes all the other NN-candidates. Sometimes no object is able to supersede every other NN-candidate. In this case, we return the SNN-core—the minimum set of NN-candidates each of which supersedes all the NN-candidates outside the SNN-core. Intuitively, the SNN-core contains the best objects, because any object outside the SNN-core is worse than all the objects in the SNN-core. We show that the SNN-core can be efficiently computed by utilizing a conventional multidimensional index, as confirmed by extensive experiments.
Nearest neighbor, uncertain, spatial database.
Sze Man Yuen, Yufei Tao, Xiaokui Xiao, Jian Pei, Donghui Zhang, "Superseding Nearest Neighbor Search on Uncertain Spatial Databases", IEEE Transactions on Knowledge & Data Engineering, vol.22, no. 7, pp. 1041-1055, July 2010, doi:10.1109/TKDE.2009.137
[1] N. Beckmann, H. Kriegel, R. Schneider, and B. Seeger, "The R$^\ast$ -Tree: An Efficient and Robust Access Method for Points and Rectangles," Proc. ACM SIGMOD, pp. 322-331, 1990.
[2] G. Beskales, M.A. Soliman, and I.F. Ilyas, "Efficient Search for the Top-k Probable Nearest Neighbors in Uncertain Databases," Proc. Very Large Data Bases (VLDB), vol. 1, no. 1, pp. 326-339, 2008.
[3] C. Bohm, A. Pryakhin, and M. Schubert, "The Gauss-Tree: Efficient Object Identification in Databases of Probabilistic Feature Vectors," Proc. Int'l Conf. Data Eng. (ICDE), 2006.
[4] R. Cheng, J. Chen, M.F. Mokbel, and C.-Y. Chow, "Probabilistic Verifiers: Evaluating Constrained Nearest-Neighbor Queries over Uncertain Data," Proc. Int'l Conf. Data Eng. (ICDE), pp. 973-982, 2008.
[5] R. Cheng, D.V. Kalashnikov, and S. Prabhakar, "Querying Imprecise Data in Moving Object Environments," IEEE Trans. Knowledge and Data Eng., vol. 16, no. 9, pp. 1112-1127, Sept. 2004.
[6] T.H. Cormen, C.E. Leiserson, R.L. Rivest, and C. Stein, Introduction to Algorithms. MIT Press, 2001.
[7] X. Dai, M.L. Yiu, N. Mamoulis, Y. Tao, and M. Vaitis, "Probabilistic Spatial Queries on Existentially Uncertain Data," Proc. Symp. Advances in Spatial and Temporal Databases (SSTD), pp. 400-417, 2005.
[8] G.R. Hjaltason and H. Samet, "Distance Browsing in Spatial Databases," ACM Trans. Database Systems, vol. 24, no. 2, pp. 265-318, 1999.
[9] M. Hua, J. Pei, W. Zhang, and X. Lin, "Ranking Queries on Uncertain Data: A Probabilistic Threshold Approach," Proc. ACM SIGMOD, pp. 673-686, 2008.
[10] H.V. Jagadish, B.C. Ooi, K.-L. Tan, C. Yu, and R. Zhang, "idistance: An Adaptive ${\rm b}^+$ -Tree Based Indexing Method for Nearest Neighbor Search," ACM Trans. Database Systems, vol. 30, no. 2, pp. 364-397, 2005.
[11] F. Korn and S. Muthukrishnan, "Influence Sets Based on Reverse Nearest Neighbor Queries," Proc. ACM SIGMOD, pp. 201-212, 2000.
[12] H.-P. Kriegel, P. Kunath, and M. Renz, "Probabilistic Nearest-Neighbor Query on Uncertain Objects," Proc. Database Systems for Advanced Applications (DASFAA), pp. 337-348, 2007.
[13] D. Papadias, Y. Tao, K. Mouratidis, and C.K. Hui, "Aggregate Nearest Neighbor Queries in Spatial Databases," ACM Trans. Database Systems, vol. 30, no. 2, pp. 529-576, 2005.
[14] J. Pei, B. Jiang, X. Lin, and Y. Yuan, "Probabilistic Skylines on Uncertain Data," Proc. Very Large Data Bases (VLDB), pp. 15-26, 2007.
[15] N. Roussopoulos, S. Kelley, and F. Vincent, "Nearest Neighbor Queries," Proc. ACM SIGMOD, pp. 71-79, 1995.
[16] T. Seidl and H.-P. Kriegel, "Optimal Multi-Step k-Nearest Neighbor Search," Proc. ACM SIGMOD, vol. 27, no. 2, pp. 154-165, 1998.
[17] M.A. Soliman, I.F. Ilyas, and K.C.-C. Chang, "Top-k Query Processing in Uncertain Databases," Proc. Int'l Conf. Data Eng. (ICDE), pp. 896-905, 2007.
[18] Y. Tao, D. Papadias, and Q. Shen, "Continuous Nearest Neighbor Search," Proc. Very Large Data Bases (VLDB), pp. 287-298, 2002.
[19] R. Weber, H.-J. Schek, and S. Blott, "A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces," Proc. Very Large Data Bases (VLDB), pp. 194-205, 1998.
[20] K. Yi, F. Li, G. Kollios, and D. Srivastava, "Efficient Processing of Top-k Queries in Uncertain Databases," Proc. Int'l Conf. Data Eng. (ICDE), pp. 1406-1408, 2008.
296 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool