The 4th International Symposium on Parallel and Distributed Computing (ISPDC'05)
Progressive Clustering for Database Distribution on a Grid
Universit? of Lille 1, France
July 04-July 06
ISBN: 0-7695-2434-6
The increasing availability of clusters and grids of workstations provides cheap and powerful ressources for distributed datamining. To exploit these ressources we need new algorithms adapted to this kind of environment, in particular with respect to the way to fragment data and to use this fragmentation. An "intelligent" distribution of data is required and can be obtained from clustering. Most existing parallel methods of clustering are developped for supercomputers with shared memory and hence can not be used on a Grid. This paper presents a new clustering algorithm, called Progressive Clustering, which executes a clustering in an efficient and incremental distributed way. The data clusters resulting from this algorithm can subsequently be used in distributed data mining tasks.
Citation:
Valerie FIOLET, Bernard TOURSEL, "Progressive Clustering for Database Distribution on a Grid," ispdc, pp.282-289, The 4th International Symposium on Parallel and Distributed Computing (ISPDC'05), 2005