This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Probabilistic Reverse Nearest Neighbor Queries on Uncertain Data
April 2010 (vol. 22 no. 4)
pp. 550-564
Muhammad Aamir Cheema, University of New South Wales, Sydney
Xuemin Lin, University of New South Wales, Sydney and NICTA
Wei Wang, University of New South Wales, Sydney and NICTA
Wenjie Zhang, University of New South Wales, Sydney and NICTA
Jian Pei, Simon Fraser Univeristy, Burnaby
Uncertain data are inherent in various important applications and reverse nearest neighbor (RNN) query is an important query type for many applications. While many different types of queries have been studied on uncertain data, there is no previous work on answering RNN queries on uncertain data. In this paper, we formalize probabilistic reverse nearest neighbor query that is to retrieve the objects from the uncertain data that have higher probability than a given threshold to be the RNN of an uncertain query object. We develop an efficient algorithm based on various novel pruning approaches that solves the probabilistic RNN queries on multidimensional uncertain data. The experimental results demonstrate that our algorithm is even more efficient than a sampling-based approximate algorithm for most of the cases and is highly scalable.

[1] E. Achtert, C. Bohm, P. Kroger, P. Kunath, A. Pryakhin, and M. Renz, "Approximate Reverse k-Nearest Neighbor Queries in General Metric Spaces," Proc. ACM Int'l Conf. Information and Knowledge Management (CIKM), pp. 788-789, 2006.
[2] E. Achtert, C. Böhm, P. Kröger, P. Kunath, A. Pryakhin, and M. Renz, "Efficient Reverse k-Nearest Neighbor Search in Arbitrary Metric Spaces," Proc. ACM SIGMOD Conf., pp. 515-526, 2006.
[3] R. Benetis, C.S. Jensen, G. Karciauskas, and S. Saltenis, "Nearest Neighbor and Reverse Nearest Neighbor Queries for Moving Objects," Proc. Int'l Database Eng. and Applications Symp. (IDEAS), pp. 44-53, 2002.
[4] G. Beskales, M. Soliman, and I.F. Ilyas, "Efficient Search for the top-k Probable Nearest Neighbors in Uncertain Databases," Proc. Int'l Conf. Very Large Data Bases (VLDB), 2008.
[5] M.A. Cheema, X. Lin, W. Wang, W. Zhang, and J. Pei, "Probabilistic Reverse Nearest Neighbor Queries on Uncertain Data," UNSW technical report, ftp://ftp.cse.unsw.edu.au/pub/doc/papers/ UNSW0816.pdf, 2008.
[6] R. Cheng, J. Chen, M.F. Mokbel, and C.-Y. Chow, "Probabilistic Verifiers: Evaluating Constrained Nearest-Neighbor Queries over Uncertain Data," Proc. IEEE Int'l Conf. Data Eng. (ICDE), pp. 973-982, 2008.
[7] R. Cheng, S. Prabhakar, and D.V. Kalashnikov, "Querying Imprecise Data in Moving Object Environments," Proc. IEEE Int'l Conf. Data Eng. (ICDE), pp. 723-725, 2003.
[8] R. Cheng, Y. Xia, S. Prabhakar, R. Shah, and J.S. Vitter, "Efficient Indexing Methods for Probabilistic Threshold Queries over Uncertain Data," Proc. Int'l Conf. Very Large Data Bases (VLDB), pp. 876-887, 2004.
[9] X. Dai, M.L. Yiu, N. Mamoulis, Y. Tao, and M. Vaitis, "Probabilistic Spatial Queries on Existentially Uncertain Data," Proc. Int'l Symp. Spatial and Temporal Databases (SSTD), pp. 400-417, 2005.
[10] J. Goldstein, R. Ramakrishnan, U. Shaft, and J.-B. Yu, "Processing Queries by Linear Constraints," Proc. 16th ACM SIGACT-SIGMOD-SIGART Symp. Principles of Database Systems (PODS '97), pp. 257-267, 1997.
[11] J.M. Kang, M.F. Mokbel, S. Shekhar, T. Xia, and D. Zhang, "Continuous Evaluation of Monochromatic and Bichromatic Reverse Nearest Neighbors," Proc. IEEE Int'l Conf. Data Eng. (ICDE), pp. 806-815, 2007.
[12] F. Korn and S. Muthukrishnan, "Influence Sets Based on Reverse Nearest Neighbor Queries," Proc. ACM SIGMOD Conf., pp. 201-212, 2000.
[13] H.-P. Kriegel, P. Kunath, M. Pfeifle, and M. Renz, "Probabilistic Similarity Join on Uncertain Data," Proc. Int'l Conf. Database Systems for Advanced Applications (DASFAA), pp. 295-309, 2006.
[14] H.-P. Kriegel, P. Kunath, and M. Renz, "Probabilistic Nearest-Neighbor Query on Uncertain Objects," Proc. Int'l Conf. Database Systems for Advanced Applications (DASFAA), pp. 337-348, 2007.
[15] K.I. Lin, M. Nolen, and C. Yang, "Applying Bulk Insertion Techniques for Dynamic Reverse Nearest Neighbor Problems," Proc. Int'l Database Eng. and Applications Symp. (IDEAS), pp. 290-297, 2003.
[16] M.F. Mokbel, C.-Y. Chow, and W.G. Aref, "The New Casper: Query Processing for Location Services without Compromising Privacy," Proc. Int'l Conf. Very Large Data Bases (VLDB), pp. 763-774, 2006.
[17] J. Pei, B. Jiang, X. Lin, and Y. Yuan, "Probabilistic Skylines on Uncertain Data," Proc. Int'l Conf. Very Large Data Bases (VLDB), pp. 15-26, 2007.
[18] A. Singh, H. Ferhatosmanoglu, and A.S. Tosun, "High Dimensional Reverse Nearest Neighbor Queries," Proc. ACM Int'l Conf. Information and Knowledge Management (CIKM), pp. 91-98, 2003.
[19] Y. Tao, R. Cheng, X. Xiao, W.K. Ngai, B. Kao, and S. Prabhakar, "Indexing Multi-Dimensional Uncertain Data with Arbitrary Probability Density Functions," Proc. Int'l Conf. Very Large Data Bases (VLDB), pp. 922-933, 2005.
[20] Y. Tao, D. Papadias, and X. Lian, "Reverse knn Search in Arbitrary Dimensionality," Proc. 13th Int'l Conf. Very Large Data Bases (VLDB '04), pp. 744-755, 2004.
[21] Y. Tao, M.L. Yiu, and N. Mamoulis, "Reverse Nearest Neighbor Search in Metric Spaces," IEEE Trans. Knowledge and Data Eng., vol. 18, no. 9, pp. 1239-1252, Sept. 2006.
[22] J. Widom, "Trio: A System for Integrated Management of Data, Accuracy, and Lineage," Proc. Conf. Innovative Data Systems Research (CIDR), pp. 262-276, 2005.
[23] W. Wu, F. Yang, C.Y. Chan, and K.-L. Tan, "Continuous Reverse k-Nearest-Neighbor Monitoring," Proc. Int'l Conf. Mobile Data Management (MDM), pp. 132-139, 2008.
[24] W. Wu, F. Yang, C.Y. Chan, and K.-L. Tan, "Finch: Evaluating Reverse k-Nearest-Neighbor Queries on Location Data," Proc. Int'l Conf. Very Large Data Bases (VLDB), 2008.
[25] T. Xia and D. Zhang, "Continuous Reverse Nearest Neighbor Monitoring," Proc. IEEE Int'l Conf. Data Eng. (ICDE), 2006.
[26] C. Yang and K.I. Lin, "An Index Structure for Efficient Reverse Nearest Neighbor Queries," Proc. 17th Int'l Conf. Data Eng., pp. 485-492, 2001.
[27] M.L. Yiu and N. Mamoulis, "Reverse Nearest Neighbors Search in Ad Hoc Subspaces," IEEE Trans. Knowledge and Data Eng., vol. 19, no. 3, pp. 412-426, Mar. 2007.
[28] M.L. Yiu, D. Papadias, N. Mamoulis, and Y. Tao, "Reverse Nearest Neighbors in Large Graphs," Proc. IEEE Int'l Conf. Data Eng. (ICDE), pp. 186-187, 2005.
[29] B. Yao, F. Li, and P. Kumar, "Visible Reverse k-Nearest Neighbor Queries," Proc. IEEE Int'l Conf. Data Eng. (ICDE), 2009.
[30] F. Korn, S. Muthukrishnan, and D. Srivastava, "Reverse Nearest Neighbor Aggregates over Data Streams," Proc. Int'l Conf. Very Large Data Bases (VLDB), pp. 814-825, 2002.
[31] K.C.K. Lee, B. Zheng, and W.C. Lee, "Ranked Reverse Nearest Neighbor Search," IEEE Trans. Knowledge and Data Eng., vol. 20, no. 7, pp. 894-910, July 2008.
[32] I. Stanoi, D. Agrawal, and A. El Abbadi, "Reverse Nearest Neighbor Queries for Dynamic Databases," Proc. ACM SIGMOD Workshop Research Issues in Data Mining and Knowledge Discovery, pp. 44-53, 2000.

Index Terms:
Query processing, reverse nearest neighbor queries, uncertain data, spatial data.
Citation:
Muhammad Aamir Cheema, Xuemin Lin, Wei Wang, Wenjie Zhang, Jian Pei, "Probabilistic Reverse Nearest Neighbor Queries on Uncertain Data," IEEE Transactions on Knowledge and Data Engineering, vol. 22, no. 4, pp. 550-564, April 2010, doi:10.1109/TKDE.2009.108
Usage of this product signifies your acceptance of the Terms of Use.