The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.06 - November/December (2009 vol.15)
pp: 993-1000
Sara Johansson , Norrköping Visualization and Interaction Studio, Linköping University, Sweden
ABSTRACT
Multivariate data sets including hundreds of variables are increasingly common in many application areas. Most multivariate visualization techniques are unable to display such data effectively, and a common approach is to employ dimensionality reduction prior to visualization. Most existing dimensionality reduction systems focus on preserving one or a few significant structures in data. For many analysis tasks, however, several types of structures can be of high significance and the importance of a certain structure compared to the importance of another is often task-dependent. This paper introduces a system for dimensionality reduction by combining user-defined quality metrics using weight functions to preserve as many important structures as possible. The system aims at effective visualization and exploration of structures within large multivariate data sets and provides enhancement of diverse structures by supplying a range of automatic variable orderings. Furthermore it enables a quality-guided reduction of variables through an interactive display facilitating investigation of trade-offs between loss of structure and the number of variables to keep. The generality and interactivity of the system is demonstrated through a case scenario.
INDEX TERMS
Dimensionality reduction, interactivity, quality metrics, variable ordering
CITATION
Sara Johansson, "Interactive Dimensionality Reduction Through User-defined Combinations of Quality Metrics", IEEE Transactions on Visualization & Computer Graphics, vol.15, no. 6, pp. 993-1000, November/December 2009, doi:10.1109/TVCG.2009.153
REFERENCES
[1] R. Agrawal, J. Gehrke, D. Gunopulos, and P. Raghavan, Automatic subspace clustering of high dimensional data for data mining applications. In ACM SIGMOD International Conference on Management of Data, pages 94–105, 1998.
[2] R. Agrawal and R. Srikant, Fast algorithms for mining association rules. In Proceedings of the 20th VLDB Conference, pages 487–499, 1994.
[3] M. Ankerst, S. Berchtold, and D. A. Keim, Similarity clustering for an enhanced visualization of multidimensional data. In Proceedings of IEEE Symposium on Information Visualization, InfoVis '98, pages 52–60, 1998.
[4] A. O. Artero, M. C. F. de Olivera, and H. Levkowitz, Enhanced high dimensional data visualization through dimension reduction and attribute arrangement. In Proceedings of the conference on Infomation Visualization IV'06, pages 707–712. IEEE, 2006.
[5] R. A. Becker and W. S. Cleveland, Brushing scatterplots. Technometrics, 29 (2): 127–142, May 1987.
[6] Q. Cui, M. O. Ward, E. A. Rundensteiner, and J. Yang, Measuring data abstraction quality in multiresolution visualization. IEEE Transactions on Visualization and Computer Graphics, 12 (5): 709–716, Oct. 2006.
[7] S. G. Eick and A. F. Karr, Visual scalability. Journal of Computational & Graphical Statistics, 11 (1): 22–43, 2002.
[8] D. Guo, Coordinating computational and visual approaches for interactive feature selection and multivariate clustering. Information Visualization, 2 (4): 232–246, 2003.
[9] A. Inselberg, The plane with parallel coordinates. The Visual Computer, 1 (4): 69–91, 1985.
[10] G. Ivosev, L. Burton, and R. Bonner, Dimensionality reduction and visualization in principal component analysis. Analytical Chemistry, 80 (13): 4933–4944, 2008.
[11] I. T. Jolliffe, Principal Component Analysis, 2 ed. Springer-Verlag, 2002.
[12] D. A. Keim, Designing pixel-oriented visualization techniques: Theory and applications. IEEE Transactions on Visualization and Computer Graphics, 6 (1): 59–78, 2000.
[13] T. Kohonen, The self-organizing map. Neurocomputing, 21 (1–3): 1–6, 1998.
[14] Y. Koren and L. Carmel, Robust linear dimensionality reduction. IEEE Transactions on Visualization and Computer Graphics, 10 (4): 459–470, 2004.
[15] J. B. Kruskal and M. Wish, Multidimensional Scaling. Sage Publications, 1978.
[16] H. Nagesh, S. Goil, and A. Choudhary, Adaptive grids for clustering massive data sets. In First Siam International Conference on Data Mining, 2001.
[17] F. Paulovich and R. Minghim, Hipp: A novel hierarchical point placement strategy and its application to the exploration of document collections. IEEE Transactions on Visualization and Computer Graphics, 14 (6): 1229–1236, 2008.
[18] W. Peng, M. O. Ward, and E. A. Rundensteiner, Clutter reduction in multi-dimensional data visualization using dimension reordering. In Proceedings of IEEE Symposium on Information Visualization, pages 89–96, Oct. 2004.
[19] R. Rao and S. K. Card, The table lens: merging graphical and symbolic representations in an interactive focus + context visualization for tabular information. In Proceedings of the SIGCHI conference on Human factors in computing systems: celebrating interdependence, pages 318–322. ACM, 1994.
[20] J. L. Rodgers and W. A. Nicewander, Thirteen ways to look at the correlation coefficient. The American Statistician, 42 (1): 59–66, Feb. 1988.
[21] J. Seo and B. Shneiderman, A rank-by-feature framework for unsupervised multidimensional data exploration using low dimensional projections. In Proceedings of IEEE Symposium on Information Visualization 2004, INFOVIS 2004, pages 65–72, October 2004.
[22] E. J. Wegman, Hyperdimensional data analysis using parallel coordinates. Journal of American Statistics Association, 85 (411): 664–675, 1990.
[23] L. Wilkinson, A. Anand, and R. Grossman, Graph-theoretic scagnostics. In Proceedings of IEEE Symposium on Information Visualization, InfoVis '05, pages 157–164, 2005.
[24] M. Williams and T. Munzner, Steerable, progressive multidimensional scaling. In Proceedings of IEEE Symposium on Information Visualization, pages 57–64, Oct. 2004.
[25] J. Yang, A. Patro, S. Huang, N. Mehta, M. O. Ward, and E. A. Rundensteiner, Value and relation display for interactive exploration of high dimensional datasets. In Proceedings of IEEE Symposium on Information Visualization 2004, INFOVIS 2004, pages 73–80, October 2004.
[26] J. Yang, W. Peng, M. O. Ward, and E. A. Rundensteiner, Interactive hierarchical dimension ordering, spacing and filtering for exploration of high dimensional datasets. In Proceedings of IEEE Symposium on Information Visualization, pages 105–112, 2003.
[27] J. Yang, M. O. Ward, and S. Huang, Visual hierarchical dimension reduction for exploration of high dimensional datasets. In Proceedings of Eurographics/IEEE TCVG Symposium on Visualization, pages 19–28, May 2003.
25 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool