Issue No.12 - Dec. (2011 vol.17)
David Gotz , IBM T.J. Watson Research Center
Jimeng Sun , IBM T.J. Watson Research Center
Huamin Qu , Hong Kong University of Science and Technology
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TVCG.2011.188
Clustering as a fundamental data analysis technique has been widely used in many analytic applications. However, it is often difficult for users to understand and evaluate multidimensional clustering results, especially the quality of clusters and their semantics. For large and complex data, high-level statistical information about the clusters is often needed for users to evaluate cluster quality while a detailed display of multidimensional attributes of the data is necessary to understand the meaning of clusters. In this paper, we introduce DICON, an icon-based cluster visualization that embeds statistical information into a multi-attribute display to facilitate cluster interpretation, evaluation, and comparison. We design a treemap-like icon to represent a multidimensional cluster, and the quality of the cluster can be conveniently evaluated with the embedded statistical information. We further develop a novel layout algorithm which can generate similar icons for similar clusters, making comparisons of clusters easier. User interaction and clutter reduction are integrated into the system to help users more effectively analyze and refine clustering results for large datasets. We demonstrate the power of DICON through a user study and a case study in the healthcare domain. Our evaluation shows the benefits of the technique, especially in support of complex multidimensional cluster analysis.
Visual Analysis, Clustering, Information Visualization.
David Gotz, Jimeng Sun, Huamin Qu, "DICON: Interactive Visual Analysis of Multidimensional Clusters", IEEE Transactions on Visualization & Computer Graphics, vol.17, no. 12, pp. 2581-2590, Dec. 2011, doi:10.1109/TVCG.2011.188