loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Fourth IEEE International Conference on Data Mining (ICDM'04)
Analysis of Consensus Partition in Cluster Ensemble
Brighton, United Kingdom
November 01-November 04
ISBN: 0-7695-2142-8
Alexander P. Topchy, Michigan State University, East Lansing
Martin H. C. Law, Michigan State University, East Lansing
Anil K. Jain, Michigan State University, East Lansing
Ana L. Fred, Instituto Superior Tecnico, Lisbon, Portugal
In combination of multiple partitions, one is usually interested in deriving a consensus solution with a quality better than that of given partitions. Several recent studies have empirically demonstrated improved accuracy of clustering ensembles on a number of artificial and real-world data sets. Unlike certain multiple supervised classifier systems, convergence properties of unsupervised clustering ensembles remain unknown for conventional combination schemes. In this paper we present formal arguments on the effectiveness of cluster ensemble from two perspectives. The first is based on a stochastic partition generation model related to re-labeling and consensus function with plurality voting. The second is to study the property of the "mean" partition of an ensemble with respect to a metric on the space of all possible partitions. In both the cases, the consensus solution can be shown to converge to a true underlying clustering solution as the number of partitions in the ensemble increases. This paper provides a rigorous justification for the use of cluster ensemble.
Citation:
Alexander P. Topchy, Martin H. C. Law, Anil K. Jain, Ana L. Fred, "Analysis of Consensus Partition in Cluster Ensemble," icdm, pp.225-232, Fourth IEEE International Conference on Data Mining (ICDM'04), 2004
Usage of this product signifies your acceptance of the Terms of Use.