The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.07 - July (2012 vol.24)
pp: 1244-1258
Xuemin Lin , University of New South Wales, Sydney
Yufei Tao , Chinese University of Hong Kong, Hong Kong
Wenjie Zhang , University of New South Wales, Sydney
Haixun Wang , Microsoft Research, Beijing
ABSTRACT
In many applications, including location-based services, queries may not be precise. In this paper, we study the problem of efficiently computing range aggregates in a multidimensional space when the query location is uncertain. Specifically, for a query point Q whose location is uncertain and a set S of points in a multidimensional space, we want to calculate the aggregate (e.g., count, average and sum) over the subset S^{\prime } of S such that for each p \in S^{\prime }, Q has at least probability \theta within the distance \gamma to p. We propose novel, efficient techniques to solve the problem following the filtering-and-verification paradigm. In particular, two novel filtering techniques are proposed to effectively and efficiently remove data points from verification. Our comprehensive experiments based on both real and synthetic data demonstrate the efficiency and scalability of our techniques.
INDEX TERMS
Uncertainty, index, range aggregate query.
CITATION
Xuemin Lin, Yufei Tao, Wenjie Zhang, Haixun Wang, "Efficient Computation of Range Aggregates against Uncertain Location-Based Queries", IEEE Transactions on Knowledge & Data Engineering, vol.24, no. 7, pp. 1244-1258, July 2012, doi:10.1109/TKDE.2011.46
REFERENCES
[1] P.K. Agarwal, S.-W. Cheng, Y. Tao, and K. Yi, "Indexing Uncertain Data," Proc. Symp. Principles of Database Systems (PODS), 2009.
[2] C. Aggarwal and P. Yu, "On High Dimensional Indexing of Uncertain Data," Proc. IEEE 24th Int'l Conf. Data Eng. (ICDE), 2008.
[3] C. Bohm, M. Gruber, P. Kunath, A. Pryakhin, and M. Schubert, "Prover: Probabilistic Video Retrieval using the Gauss-Tree," Proc. IEEE 23rd Int'l Conf. Data Eng. (ICDE), 2007.
[4] C. Bohm, A. Pryakhin, and M. Schubert, "Probabilistic Ranking Queries on Gaussians," Proc. 18th Int'l Conf. Scientific and Statistical Database Management (SSDBM), 2006.
[5] V. Bryant, Metric Spaces: Iteration and Application. Cambridge Univ. Press, 1996.
[6] J. Chen and R. Cheng, "Efficient Evaluation of Imprecise Location-Dependent Queries," Proc. IEEE 23rd Int'l Conf. Data Eng. (ICDE), 2007.
[7] R. Cheng, J. Chen, M.F. Mokbel, and C.-Y. Chow, "Probabilistic Verifiers: Evaluating Constrained Nearest-neighbor Queries over Uncertain Data," Proc. IEEE Int'l Conf. Data Eng. (ICDE), 2008.
[8] R. Cheng, D.V. Kalashnikov, and S. Prabhakar, "Evaluating Probabilistic Queries over Imprecise Data," Proc. ACM SIGMOD Int'l Conf. Management of Data, 2003.
[9] R. Cheng, S. Singh, and S. Prabhakar, "Efficient Join Processing over Uncertain Data," Proc. Int'l Conf. Information and Knowledge Management (CIKM), 2006.
[10] R. Cheng, Y. Xia, S. Prabhakar, R. Shah, and J.S. Vitter, "Effcient Indexing Methods for Probabilistic Threshold Queries over Uncertain Data," Proc. Int'l Conf. Very Large Data Bases (VLDB), 2004.
[11] G.W. Cordner, "Police Patrol Work Load Studies: A Review and Critique," Police Studies, vol. 2, no. 3, pp. 50-60, 1979.
[12] X. Dai, M. Yiu, N. Mamoulis, Y. Tao, and M. Vaitis, "Probabilistic Spatial Queries on Existentially Uncertain Data," Proc. Int'l Symp. Large Spatio-Temporal Databases (SSTD), 2005.
[13] E. Frentzos, K. Gratsias, and Y. Theodoridis, "On the Effect of Location Uncertainty in Spatial Querying," IEEE Trans. Knowledge Data Eng., vol. 21, no. 3, pp. 366-383, Mar. 2009.
[14] M. Hua, J. Pei, W. Zhang, and X. Lin, "Ranking Queries on Uncertain Data: A Probabilistic Threshold Approach," Proc. ACM SIGMOD Int'l Conf. Management of Data, 2008.
[15] Y. Ishikawa, Y. Iijima, and J.X. Yu, "Spatial Range Querying for Gaussian-Based Imprecise Query Objects," Proc. IEEE 25th Int'l Conf. Data Eng. (ICDE), 2009.
[16] H.-P. Kriegel, P. Kunath, M. Pfeifle, and M. Renz, "Probabilistic Similarity Join on Uncertain Data," Proc. Int'l Conf. Database Systems for Advanced Applications (DASFAA), 2006.
[17] H.P. Kriegel and M. Pfeifle, "Density-Based Clustering of Uncertain Data," Proc. 11th ACM SIGKDD Int'l Conf. Knowledge Discovery in Data Mining (KDD), 2005.
[18] X. Lian and L. Chen, "Monochromatic and Bichromatic Reverse Skyline Search over Uncertain Databases," Proc. ACM SIGMOD Int'l Conf. Management of Data, 2008.
[19] R. Meester, A Natural Introduction to Probability Theory. Addison Wesley, 2004.
[20] W.K. Ngai, B. Kao, C.K. Chui, R. Cheng, M. Chau, and K.Y. Yip, "Efficient Clustering of Uncertain Data," Proc. Int'l Conf. Data Mining (ICDM), 2006.
[21] J. Ni, C.V. Ravishankar, and B. Bhanu, "Probabilistic Spatial Database Operations," Proc. Int'l Symp. Large Spatio-Temporal Databases (SSTD), 2003.
[22] D. Papadias, P. Kalnis, J. Zhang, and Y. Tao, "Efficient Olap Operations in Spatial Data Warehouses," Proc. Int'l Symp. Large Spatio-Temporal Databases (SSTD), 2001.
[23] J. Pei, B. Jiang, X. Lin, and Y. Yuan, "Probabilistic Skyline on Uncertain Data," Proc. Int'l Conf. Very Large Data Bases (VLDB), 2007.
[24] G.M. Siouris, Missile Guidance and Control Systems. Springer Publication, 2004.
[25] M.A. Soliman, I.F. Ilyas, and K.C. Chang, "Top-$k$ Query Processing in Uncertain Databases," Proc. Int'l Conf. Data Eng. (ICDE), 2007.
[26] Y. Tao, R. Cheng, X. Xiao, W.K. Ngai, B. Kao, and S. Prabhakar, "Indexing Multi-Dimensional Uncertain Data with Arbitrary Probability Density Functions," Proc. Int'l Conf. Very Large Data Bases (VLDB), 2005.
[27] Y. Tao and D. Papadias, "Range Aggregate Processing in Spatial Databases," IEEE Trans. Knowledge Data Eng., vol. 16, no. 12, pp. 1555-1570, Dec. 2004.
[28] Y. Tao, X. Xiao, and R. Cheng, "Range Search on Multidimensional Uncertain Data," ACM Trans. Database Systems, vol. 32, no. 3, pp. 1-54, 2007.
[29] S. Yang, W. Zhang, Y. Zhang, and X. Lin, "Probabilistic Threshold Range Aggregate Query Processing over Uncertain Data," Proc. Joint Int'l Conf. Advances in Data and Web Management (APWeb/WAIM), 2009.
[30] X. Yu and S. Mehrotra, "Capturing Uncertainty in Spatial Queries over Imprecise Data," Proc. Int'l Conf. Database and Expert Systems Applications(DEXA), 2003.
5 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool