Subscribe

Issue No.04 - April (2010 vol.22)

pp: 550-564

Muhammad Aamir Cheema , University of New South Wales, Sydney

Xuemin Lin , University of New South Wales, Sydney and NICTA

Wei Wang , University of New South Wales, Sydney and NICTA

Wenjie Zhang , University of New South Wales, Sydney and NICTA

Jian Pei , Simon Fraser Univeristy, Burnaby

DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TKDE.2009.108

ABSTRACT

Uncertain data are inherent in various important applications and reverse nearest neighbor (RNN) query is an important query type for many applications. While many different types of queries have been studied on uncertain data, there is no previous work on answering RNN queries on uncertain data. In this paper, we formalize probabilistic reverse nearest neighbor query that is to retrieve the objects from the uncertain data that have higher probability than a given threshold to be the RNN of an uncertain query object. We develop an efficient algorithm based on various novel pruning approaches that solves the probabilistic RNN queries on multidimensional uncertain data. The experimental results demonstrate that our algorithm is even more efficient than a sampling-based approximate algorithm for most of the cases and is highly scalable.

INDEX TERMS

Query processing, reverse nearest neighbor queries, uncertain data, spatial data.

CITATION

Muhammad Aamir Cheema, Xuemin Lin, Wei Wang, Wenjie Zhang, Jian Pei, "Probabilistic Reverse Nearest Neighbor Queries on Uncertain Data",

*IEEE Transactions on Knowledge & Data Engineering*, vol.22, no. 4, pp. 550-564, April 2010, doi:10.1109/TKDE.2009.108REFERENCES

- [1] E. Achtert, C. Bohm, P. Kroger, P. Kunath, A. Pryakhin, and M. Renz, "Approximate Reverse k-Nearest Neighbor Queries in General Metric Spaces,"
Proc. ACM Int'l Conf. Information and Knowledge Management (CIKM), pp. 788-789, 2006.- [2] E. Achtert, C. Böhm, P. Kröger, P. Kunath, A. Pryakhin, and M. Renz, "Efficient Reverse k-Nearest Neighbor Search in Arbitrary Metric Spaces,"
Proc. ACM SIGMOD Conf., pp. 515-526, 2006.- [4] G. Beskales, M. Soliman, and I.F. Ilyas, "Efficient Search for the top-k Probable Nearest Neighbors in Uncertain Databases,"
Proc. Int'l Conf. Very Large Data Bases (VLDB), 2008.- [5] M.A. Cheema, X. Lin, W. Wang, W. Zhang, and J. Pei, "Probabilistic Reverse Nearest Neighbor Queries on Uncertain Data," UNSW technical report, ftp://ftp.cse.unsw.edu.au/pub/doc/papers/ UNSW0816.pdf, 2008.
- [6] R. Cheng, J. Chen, M.F. Mokbel, and C.-Y. Chow, "Probabilistic Verifiers: Evaluating Constrained Nearest-Neighbor Queries over Uncertain Data,"
Proc. IEEE Int'l Conf. Data Eng. (ICDE), pp. 973-982, 2008.- [7] R. Cheng, S. Prabhakar, and D.V. Kalashnikov, "Querying Imprecise Data in Moving Object Environments,"
Proc. IEEE Int'l Conf. Data Eng. (ICDE), pp. 723-725, 2003.- [8] R. Cheng, Y. Xia, S. Prabhakar, R. Shah, and J.S. Vitter, "Efficient Indexing Methods for Probabilistic Threshold Queries over Uncertain Data,"
Proc. Int'l Conf. Very Large Data Bases (VLDB), pp. 876-887, 2004.- [9] X. Dai, M.L. Yiu, N. Mamoulis, Y. Tao, and M. Vaitis, "Probabilistic Spatial Queries on Existentially Uncertain Data,"
Proc. Int'l Symp. Spatial and Temporal Databases (SSTD), pp. 400-417, 2005.- [10] J. Goldstein, R. Ramakrishnan, U. Shaft, and J.-B. Yu, "Processing Queries by Linear Constraints,"
Proc. 16th ACM SIGACT-SIGMOD-SIGART Symp. Principles of Database Systems (PODS '97), pp. 257-267, 1997.- [11] J.M. Kang, M.F. Mokbel, S. Shekhar, T. Xia, and D. Zhang, "Continuous Evaluation of Monochromatic and Bichromatic Reverse Nearest Neighbors,"
Proc. IEEE Int'l Conf. Data Eng. (ICDE), pp. 806-815, 2007.- [12] F. Korn and S. Muthukrishnan, "Influence Sets Based on Reverse Nearest Neighbor Queries,"
Proc. ACM SIGMOD Conf., pp. 201-212, 2000.- [13] H.-P. Kriegel, P. Kunath, M. Pfeifle, and M. Renz, "Probabilistic Similarity Join on Uncertain Data,"
Proc. Int'l Conf. Database Systems for Advanced Applications (DASFAA), pp. 295-309, 2006.- [14] H.-P. Kriegel, P. Kunath, and M. Renz, "Probabilistic Nearest-Neighbor Query on Uncertain Objects,"
Proc. Int'l Conf. Database Systems for Advanced Applications (DASFAA), pp. 337-348, 2007.- [16] M.F. Mokbel, C.-Y. Chow, and W.G. Aref, "The New Casper: Query Processing for Location Services without Compromising Privacy,"
Proc. Int'l Conf. Very Large Data Bases (VLDB), pp. 763-774, 2006.- [17] J. Pei, B. Jiang, X. Lin, and Y. Yuan, "Probabilistic Skylines on Uncertain Data,"
Proc. Int'l Conf. Very Large Data Bases (VLDB), pp. 15-26, 2007.- [18] A. Singh, H. Ferhatosmanoglu, and A.S. Tosun, "High Dimensional Reverse Nearest Neighbor Queries,"
Proc. ACM Int'l Conf. Information and Knowledge Management (CIKM), pp. 91-98, 2003.- [19] Y. Tao, R. Cheng, X. Xiao, W.K. Ngai, B. Kao, and S. Prabhakar, "Indexing Multi-Dimensional Uncertain Data with Arbitrary Probability Density Functions,"
Proc. Int'l Conf. Very Large Data Bases (VLDB), pp. 922-933, 2005.- [20] Y. Tao, D. Papadias, and X. Lian, "Reverse knn Search in Arbitrary Dimensionality,"
Proc. 13th Int'l Conf. Very Large Data Bases (VLDB '04), pp. 744-755, 2004.- [21] Y. Tao, M.L. Yiu, and N. Mamoulis, "Reverse Nearest Neighbor Search in Metric Spaces,"
IEEE Trans. Knowledge and Data Eng., vol. 18, no. 9, pp. 1239-1252, Sept. 2006.- [22] J. Widom, "Trio: A System for Integrated Management of Data, Accuracy, and Lineage,"
Proc. Conf. Innovative Data Systems Research (CIDR), pp. 262-276, 2005.- [23] W. Wu, F. Yang, C.Y. Chan, and K.-L. Tan, "Continuous Reverse k-Nearest-Neighbor Monitoring,"
Proc. Int'l Conf. Mobile Data Management (MDM), pp. 132-139, 2008.- [24] W. Wu, F. Yang, C.Y. Chan, and K.-L. Tan, "Finch: Evaluating Reverse k-Nearest-Neighbor Queries on Location Data,"
Proc. Int'l Conf. Very Large Data Bases (VLDB), 2008.- [25] T. Xia and D. Zhang, "Continuous Reverse Nearest Neighbor Monitoring,"
Proc. IEEE Int'l Conf. Data Eng. (ICDE), 2006.- [26] C. Yang and K.I. Lin, "An Index Structure for Efficient Reverse Nearest Neighbor Queries,"
Proc. 17th Int'l Conf. Data Eng., pp. 485-492, 2001.- [28] M.L. Yiu, D. Papadias, N. Mamoulis, and Y. Tao, "Reverse Nearest Neighbors in Large Graphs,"
Proc. IEEE Int'l Conf. Data Eng. (ICDE), pp. 186-187, 2005.- [29] B. Yao, F. Li, and P. Kumar, "Visible Reverse k-Nearest Neighbor Queries,"
Proc. IEEE Int'l Conf. Data Eng. (ICDE), 2009.- [30] F. Korn, S. Muthukrishnan, and D. Srivastava, "Reverse Nearest Neighbor Aggregates over Data Streams,"
Proc. Int'l Conf. Very Large Data Bases (VLDB), pp. 814-825, 2002.- [32] I. Stanoi, D. Agrawal, and A. El Abbadi, "Reverse Nearest Neighbor Queries for Dynamic Databases,"
Proc. ACM SIGMOD Workshop Research Issues in Data Mining and Knowledge Discovery, pp. 44-53, 2000. |