This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
The impact of cluster representatives on the convergence of the K-modes type clustering
June 2013 (vol. 35 no. 6)
pp. 1509-1522
Liang Bai, Shaxi University, Shanxi and City University of Hong Kong, Hong Kong
Jiye Liang, Shaxi University, Shanxi
Chuangyin Dang, City University of Hong Kong, Hong Kong
Fuyuan Cao, Shaxi University, Shanxi
As a leading partitional clustering technique, $(k)$-modes is one of the most computationally efficient clustering methods for categorical data. In the $(k)$-modes, a cluster is represented by a “mode,” which is composed of the attribute value that occurs most frequently in each attribute domain of the cluster, whereas, in real applications, using only one attribute value in each attribute to represent a cluster may not be adequate as it could in turn affect the accuracy of data analysis. To get rid of this deficiency, several modified clustering algorithms were developed by assigning appropriate weights to several attribute values in each attribute. Although these modified algorithms are quite effective, their convergence proofs are lacking. In this paper, we analyze their convergence property and prove that they cannot guarantee to converge under their optimization frameworks unless they degrade to the original $(k)$--modes type algorithms. Furthermore, we propose two different modified algorithms with weighted cluster prototypes to overcome the shortcomings of these existing algorithms. We rigorously derive updating formulas for the proposed algorithms and prove the convergence of the proposed algorithms. The experimental studies show that the proposed algorithms are effective and efficient for large categorical datasets.
Index Terms:
Clustering algorithms,Prototypes,Algorithm design and analysis,Convergence,Optimization,Linear programming,Frequency measurement,convergence,Clustering,$(K)$-modes type clustering algorithms,categorical data,weighted cluster prototype
Citation:
Liang Bai, Jiye Liang, Chuangyin Dang, Fuyuan Cao, "The impact of cluster representatives on the convergence of the K-modes type clustering," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 35, no. 6, pp. 1509-1522, June 2013, doi:10.1109/TPAMI.2012.228
Usage of this product signifies your acceptance of the Terms of Use.