This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
An Approach to Active Spatial Data Mining Based on Statistical Information
September/October 2000 (vol. 12 no. 5)
pp. 715-728

Abstract—Spatial data mining presents new challenges due to the large size of spatial data, the complexity of spatial data types, and the special nature of spatial access methods. Most research in this area has focused on efficient query processing of static data. This paper introduces an active spatial data mining approach that extends the current spatial data mining algorithms to efficiently support user-defined triggers on dynamically evolving spatial data. To exploit the locality of the effect of an update and the nature of spatial data, we employ a hierarchical structure with associated statistical information at the various levels of the hierarchy and decompose the user-defined trigger into a set of subtriggers associated with cells in the hierarchy. Updates are suspended in the hierarchy until their cumulative effect might cause the trigger to fire. It is shown that this approach achieves three orders of magnitude improvement over the naive approach that reevaluate the condition over the database for each update, while both approaches produce the same result without any delay. Moreover, this scheme can support incremental query processing as well.

[1] R. Agrawal and G. Psaila, “Active Data Mining,” Proc. First Int'l Conf. Knowledge Discovery and Data Mining (KDD), pp. 3–8, 1995.
[2] M. Ankerst, B. Braunmeller, H.-P. Kriegel, and T. Seidl, “Improving Adaptable Similarity Query Processing by Using Approximations,” Proc. 24th Int'l Conf. Very Large Data Bases (VLDB), 1998.
[3] M. Ester, H.-P. Kriegel, J. Sander, and X. Xu, “A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise,” Proc. Second Int'l Conf. Knowledge Discovery and Data Mining (KDD), pp. 226–231, 1996.
[4] M. Ester, H.-P. Kriegel, and J. Sander, “Spatial Data Mining: A Database Approach,” Proc. Fifth Int'l Symp. Spatial Databases (SSD), pp. 47–66, 1997.
[5] M. Ester, H.-P. Kriegel, J. Sander, M. Wimmer, and X. Xu, “Incremental Clustering for Mining in a Data Warehousing Environment,” Proc. 24th Int'l Conf. Very Large Data Bases (VLDB), 1998.
[6] M. Ester, A. Frommelt, H.-P. Kriegel, and J. Sander, “Algorithms for Characterization and Trend Detection in Spatial Databases,” Proc. Fourth Int'l Conf. Knowledge Discovery and Data Mining (KDD), 1998.
[7] R. Feldman, Y. Aumann, A. Amir, and H. Mannila, “Efficient Algorithms for Discovering Frequent Sets in Incremental Databases,” Proc. ACM SIGMOD Workshop Research Issues on Data Mining and Knowledge Discovery (DMKD), pp. 59–66, 1997.
[8] A. Galton, “Continuous Change in Spatial Regions,” Proc. Third Int'l Conf. Spatial Information Theory (COSIT), pp. 1–13, 1997.
[9] S. Grumbach, P. Rigaux, and L. Segoufin, The DEDALE System for Complex Spatial Queries Proc. ACM SIGMOD Int'l Conf. Management of Data, pp. 213-224, June 1998.
[10] J. Han, K. Koperski, and N. Stefanovic, “GeoMiner: A System Prototype for Spatial Data Mining,” Proc. ACM SIGMOD Int'l Conf. Management of Data, pp. 553–556, 1997.
[11] K. Hornsby and M.J. Egenhofer, “Qualitative Representation of Change,” Proc. Third Int'l Conf. Spatial Information Theory (COSIT), pp. 15–33, 1997.
[12] E.M. Knorr and R.T. Ng, “Finding Aggregate Proximity Relationships and Commonalities in Spatial Data Mining,” IEEE Trans. Knowledge and Data Eng., vol. 8, no. 6, pp. 884–897, 1996.
[13] E.M. Knorr and R.T. Ng, “Extraction of Spatial Proximity Patterns by Concept Generalization,” Proc. Second Int'l Conf. Knowledge Discovery and Data Mining (KDD), pp. 347–350, 1996.
[14] E.M. Knorr, R.T. Ng, and D.L. Shilvock, “Finding Boundary Shape Matching Relationships in Spatial Data,” Proc. Fifth Int'l Symp. Spatial Databases (SSD), pp. 29–46, 1997.
[15] K. Koperski, J. Adhikary, and J. Han, “Spatial Data Mining: Progress and Challenges,” Proc. SIGMOD Workshop Research Issues on Data Mining and Knowledge Discovery (DMKD), 1996.
[16] R.T. Ng and J. Han, "Efficient and Effective Clustering Methods for Spatial Data Mining," Proc. 20th Int'l Conf. Very Large Databases, Morgan Kaufmann, 1994, pp. 144-155.
[17] W. Wang, J. Yang, and R.R. Muntz, "Sting: A Statistical Information Grid Approach to Spatial Data Mining," Proc. 23rd Int'l Conf. Very Large Databases, Morgan Kaufmann, 1997, pp. 186-195.
[18] J. Widom and S. Ceri, Active Database Systems. Morgan Kaufmann, 1996.
[19] O.R. Zaiane, J. Han, Z.-N. Li, S.H. Chee, and J. Chiang, “MultiMediaMiner: A System Prototype for Multimedia Data Mining,” Proc. ACM SIGMOD Int'l Conf. Management of Data, pp. 581–583, 1998.
[20] C. Zaniolo, S. Ceri, C. Faloutsos, R.T. Snodgrass, V.S. Subrahmanian, and R. Zicari, Advanced Database Systems. Morgan Kaufmann Publishers, 1997.
[21] T. Zhang, R. Ramakrishnan, and M. Livny, "Birch: An Efficient Data Clustering Method for Very Large Databases," Proc. ACM SIGMOD Int'l Conf. Management of Data, ACM Press, 1996, pp. 103-114.

Index Terms:
Spatial data mining, spatial databases, active data mining, incremental trigger evaluation.
Citation:
Wei Wang, Jiong Yang, Richard Muntz, "An Approach to Active Spatial Data Mining Based on Statistical Information," IEEE Transactions on Knowledge and Data Engineering, vol. 12, no. 5, pp. 715-728, Sept.-Oct. 2000, doi:10.1109/69.877504
Usage of this product signifies your acceptance of the Terms of Use.