Issue No. 06 - December (1996 vol. 8)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/69.553156
<p><b>Abstract</b>—In this paper, we study two spatial knowledge discovery problems involving proximity relationships between <it>clusters</it> and <it>features</it>. The first problem is: Given a cluster of points, how can we efficiently find features (represented as polygons) that are closest to the majority of points in the cluster? We measure proximity in an aggregate sense due to the nonuniform distribution of points in a cluster (e.g., houses on a map), and the different shapes and sizes of features (e.g., natural or man-made geographic features). The second problem is: Given <it>n</it> clusters of points, how can we extract the aggregate proximity commonalities (i.e., features) that apply to most, if not all, of the <it>n</it> clusters? Regarding the first problem, the main contribution of the paper is the development of Algorithm CRH which uses geometric approximations (i.e., circles, rectangles, and convex hulls) to filter and select features. Highly scalable and incremental, Algorithm CRH can examine over 50,000 features and their spatial relationships with a given cluster in approximately one second of CPU time. Regarding the second problem, the key contribution is the development of Algorithm GenCom that makes use of concept generalization to effectively derive many meaningful commonalities that cannot be found otherwise.</p>
Spatial knowledge discovery, concept generalization, proximity relationships, geometric filtering, GIS.
E. M. Knorr and R. T. Ng, "Finding Aggregate Proximity Relationships and Commonalities in Spatial Data Mining," in IEEE Transactions on Knowledge & Data Engineering, vol. 8, no. , pp. 884-897, 1996.