|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
2009 Ninth IEEE International Conference on Data Mining
A New Clustering Algorithm Based on Regions of Influence with Self-Detection of the Best Number of Clusters
Miami, Florida
December 06-December 09
ISBN: 978-0-7695-3895-2
| ASCII Text | x | ||
| Fabrice Muhlenbach, Stéphane Lallich, "A New Clustering Algorithm Based on Regions of Influence with Self-Detection of the Best Number of Clusters," Data Mining, IEEE International Conference on, pp. 884-889, 2009 Ninth IEEE International Conference on Data Mining, 2009. | |||
| BibTex | x | ||
| @article{ 10.1109/ICDM.2009.133, author = {Fabrice Muhlenbach and Stéphane Lallich}, title = {A New Clustering Algorithm Based on Regions of Influence with Self-Detection of the Best Number of Clusters}, journal ={Data Mining, IEEE International Conference on}, volume = {0}, year = {2009}, issn = {1550-4786}, pages = {884-889}, doi = {http://doi.ieeecomputersociety.org/10.1109/ICDM.2009.133}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - Data Mining, IEEE International Conference on TI - A New Clustering Algorithm Based on Regions of Influence with Self-Detection of the Best Number of Clusters SN - 1550-4786 SP884 EP889 A1 - Fabrice Muhlenbach, A1 - Stéphane Lallich, PY - 2009 KW - clustering KW - neighborhood graph VL - 0 JA - Data Mining, IEEE International Conference on ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICDM.2009.133
Clustering methods usually require to know the best number of clusters, or another parameter, e.g. a threshold, which is not ever easy to provide. This paper proposes a new graph-based clustering method called GBC which detects automatically the best number of clusters, without requiring any other parameter. In this method based on regions of influence, a graph is constructed and the edges of the graph having the higher values are cut according to a hierarchical divisive procedure. An index is calculated from the size average of the cut edges which self-detects the more appropriate number of clusters. The results of GBC for 3 quality indices (Dunn, Silhouette and Davies-Bouldin) are compared with those of K-Means, Ward's hierarchical clustering method and DBSCAN on 8 benchmarks. The experiments show the good performance of GBC in the case of well separated clusters, even if the data are unbalanced, non-convex or with presence of outliers, whatever the shape of the clusters.
Index Terms:
clustering, neighborhood graph
Citation:
Fabrice Muhlenbach, Stéphane Lallich, "A New Clustering Algorithm Based on Regions of Influence with Self-Detection of the Best Number of Clusters," icdm, pp.884-889, 2009 Ninth IEEE International Conference on Data Mining, 2009
Usage of this product signifies your acceptance of the Terms of Use.
