The Community for Technology Leaders
Green Image
This paper proposes a k\hbox{-}{\rm{means}} type clustering algorithm that can automatically calculate variable weights. A new step is introduced to the k\hbox{-}{\rm{means}} clustering process to iteratively update variable weights based on the current partition of data and a formula for weight calculation is proposed. The convergency theorem of the new clustering process is given. The variable weights produced by the algorithm measure the importance of variables in clustering and can be used in variable selection in data mining applications where large and complex real data are often involved. Experimental results on both synthetic and real data have shown that the new algorithm outperformed the standard k\hbox{-}{\rm{means}} type algorithms in recovering clusters in data.
Clustering, data mining, mining methods and algorithms, feature evaluation and selection.
Zichen Li, Michael K. Ng, Joshua Zhexue Huang, Hongqiang Rong, "Automated Variable Weighting in k-Means Type Clustering", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 27, no. , pp. 657-668, May 2005, doi:10.1109/TPAMI.2005.95
91 ms
(Ver 3.1 (10032016))