The Community for Technology Leaders
Database Engineering and Applications Symposium, International (2006)
Delhi, India
Dec. 11, 2006 to Dec. 14, 2006
ISSN: 1098-8068
ISBN: 0-7695-2577-6
pp: 309-310
P.R. Rao , DCST, Goa University, India
Jyoti Pawar , DCST, Goa University, India
The subspace clustering algorithm CLIQUE finds all subspace clusters including overlapping clusters existing in high dimensional datasets. CLIQUE consists of three main steps namely (1) Identification of subspaces that contain clusters, (2) Identification of clusters and (3)Generation of the minimal description for the clusters obtained in step two. In this paper, we have presented a method for speeding-up the first step of the CLIQUE algorithm. The proposed method is based on accessing the data from columns instead of rows. It is very efficient when there are many missing values in the high dimensional datasets given in the form of table. We have also proposed a depth-first method to find the maximal dense units, to further improve the performance of the first step.
