Issue No. 06 - November/December (2010 vol. 16)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TVCG.2010.216
Tuan Pham , Oregon State University
Rob Hess , Oregon State University
Crystal Ju , Oregon State University
Eugene Zhang , Oregon State University
Ronald Metoyer , Oregon State University
Understanding the diversity of a set of multivariate objects is an important problem in many domains, including ecology, college admissions, investing, machine learning, and others. However, to date, very little work has been done to help users achieve this kind of understanding. Visual representation is especially appealing for this task because it offers the potential to allow users to efficiently observe the objects of interest in a direct and holistic way. Thus, in this paper, we attempt to formalize the problem of visualizing the diversity of a large (more than 1000 objects), multivariate (more than 5 attributes) data set as one worth deeper investigation by the information visualization community. In doing so, we contribute a precise definition of diversity, a set of requirements for diversity visualizations based on this definition, and a formal user study design intended to evaluate the capacity of a visual representation for communicating diversity information. Our primary contribution, however, is a visual representation, called the Diversity Map, for visualizing diversity. An evaluation of the Diversity Map using our study design shows that users can judge elements of diversity consistently and as or more accurately than when using the only other representation specifically designed to visualize diversity.
information visualization, diversity, categorical data, multivariate data, evaluation
T. Pham, C. Ju, R. Hess, E. Zhang and R. Metoyer, "Visualization of Diversity in Large Multivariate Data Sets," in IEEE Transactions on Visualization & Computer Graphics, vol. 16, no. , pp. 1053-1062, 2010.