First IEEE International Conference on Data Mining (ICDM'01)
An Agglomerative Hierarchical Clustering Using Partial Maximum Array and Incremental Similarity Computation Method
San Jose, California
November 29-December 02
ISBN: 0-7695-1119-8
As the tractable amount of data is growing in computer science area, fast clustering algorithm is being required because traditional clustering algorithms are not so feasible for very large and high dimensional data. Many studies have been reported for clustering of large database, but most of them circumvent this problem by using the approximation method to result in the deterioration of accuracy. In this paper, we propose a new clustering algorithm by means of partial maximum array, which can realize the agglomerative hierarchical clustering with the same accuracy to the brute-force algorithm and has O(N 2 ) time complexity. And we also present the incremental method of similarity computation which substitutes the scalar calculation for the time-consuming calculation of vector similarity. The experimental results show that clustering becomes significantly fast for large and high dimensional data.
Citation:
Sung Jung, Taek-Soo Kim, "An Agglomerative Hierarchical Clustering Using Partial Maximum Array and Incremental Similarity Computation Method," icdm, pp.265, First IEEE International Conference on Data Mining (ICDM'01), 2001