16th International Conference on Pattern Recognition (ICPR'02) - Volume 2
Fast Hierarchical Clustering Based on Compressed Data
Quebec City, QC, Canada
August 11-August 15
ISBN: 0-7695-1695-X
Clustering in data mining is the process of discovering groups in a dataset, in such way, that the similarity between the elements of the same cluster is maximum and between different clusters is minimal. Some algorithms attempt to group a representative sample of the whole dataset and later to perform a labeling process in order to group the rest of the original database. Other algorithms perform a pre-clustering phase and later apply some classic clustering algorithm in order to create the final clusters. We present a pre-clustering algorithm that not only provides good results and efficient optimization of main memory but it also is independent of the data input order. The efficiency of the proposed algorithm, is shown and a comparison of it with the pre-clustering BIRCH algorithm BIRCH [9].
Citation:
Erendira Rendon, Ricardo Barandela, "Fast Hierarchical Clustering Based on Compressed Data," icpr, vol. 2, pp.20216, 16th International Conference on Pattern Recognition (ICPR'02) - Volume 2, 2002