CSDL Home IEEE Transactions on Pattern Analysis & Machine Intelligence 2007 vol.29 Issue No.03 - March
Issue No.03 - March (2007 vol.29)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TPAMI.2007.53
This correspondence describes extensions to the k-modes algorithm for clustering categorical data. By modifying a simple matching dissimilarity measure for categorical objects, a heuristic approach was developed in ,  which allows the use of the k-modes paradigm to obtain a cluster with strong intrasimilarity and to efficiently cluster large categorical data sets. The main aim of this paper is to rigorously derive the updating formula of the k-modes clustering algorithm with the new dissimilarity measure and the convergence of the algorithm under the optimization framework.
Data mining, clustering, k-modes algorithm, categorical data.
Michael K. Ng, Mark Junjie Li, Joshua Zhexue Huang, Zengyou He, "On the Impact of Dissimilarity Measure in k-Modes Clustering Algorithm", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.29, no. 3, pp. 503-507, March 2007, doi:10.1109/TPAMI.2007.53