2009 Ninth IEEE International Conference on Data Mining Semi-supervised Density-Based Clustering Miami, Florida December 06-December 09 ISBN: 978-0-7695-3895-2
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICDM.2009.143
Most of the effort in the semi-supervised clustering literature was devoted to variations of the K-means algorithm. In this paper we show how background knowledge can be used to bias a partitional density-based clustering algorithm. Our work describes how labeled objects can be used to help the algorithm detecting suitable density parameters for the algorithm to extract density-based clusters in specific parts of the feature space. Considering the set of constraints estabilished by the labeled dataset we show that our algorithm, called SSDBSCAN, automatically finds density parameters for each natural cluster in a dataset. Four of the most interesting characteristics of SSDBSCAN are that (1) it only requires a single, robust input parameter, (2) it does not need any user intervention, (3) it automaticaly finds the noise objects according to the density of the natural clusters and (4) it is able to find the natural cluster structure even when the density among clusters vary widely. The algorithm presented in this paper is evaluated with artificial and real-world datasets, demonstrating better results when compared to other unsupervised and semi-supervised density-based approaches.
Index Terms:
Semi-supervised, density-based clustering
Citation:
Levi Lelis, Jörg Sander, "Semi-supervised Density-Based Clustering," icdm, pp.842-847, 2009 Ninth IEEE International Conference on Data Mining, 2009 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||