Fifth IEEE International Conference on Data Mining (ICDM'05)
Parameter-Free Spatial Data Mining Using MDL
Houston, Texas
November 27-November 30
ISBN: 0-7695-2278-5
Consider spatial data consisting of a set of binary features taking values over a collection of spatial extents (grid cells). We propose a method that simultaneously finds spatial correlation and feature co-occurrence patterns, without any parameters. In particular, we employ the Minimum Description Length (MDL) principle coupled with a natural way of compressing regions. This defines what "good" means: a feature co-occurrence pattern is good, if it helps us better compress the set of locations for these features. Conversely, a spatial correlation is good, if it helps us better compress the set of features in the corresponding region. Our approach is scalable for large datasets (both number of locations and of features). We evaluate our method on both real and synthetic datasets.
Citation:
Spiros Papadimitriou, Aristides Gionis, Panayiotis Tsaparas, Risto A. Väisänen, Heikki Mannila, Christos Faloutsos, "Parameter-Free Spatial Data Mining Using MDL," icdm, pp.346-353, Fifth IEEE International Conference on Data Mining (ICDM'05), 2005