loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Measuring Data Abstraction Quality in Multiresolution Visualizations
September-October 2006 (vol. 12 no. 5)
pp. 709-716
Data abstraction techniques are widely used in multiresolution visualization systems to reduce visual clutter and facilitate analysis from overview to detail. However, analysts are usually unaware of how well the abstracted data represent the original dataset, which can impact the reliability of results gleaned from the abstractions. In this paper, we define two data abstraction quality measures for computing the degree to which the abstraction conveys the original dataset: the Histogram Difference Measure and the Nearest Neighbor Measure. They have been integrated within XmdvTool, a public-domain multiresolution visualization system for multivariate data analysis that supports sampling as well as clustering to simplify data. Several interactive operations are provided, including adjusting the data abstraction level, changing selected regions, and setting the acceptable data abstraction quality level. Conducting these operations, analysts can select an optimal data abstraction level. Also, analysts can compare different abstraction methods using the measures to see how well relative data density and outliers are maintained, and then select an abstraction method that meets the requirement of their analytic tasks.

[1] 709 S. Acharya, P. Gibbons, and V. Poosala, Congressional samples for approximate answering of group-by queries. Proc. ACM SIGMOD International Conference on Management of Data, pages 487–498, 2000.[2] C. Ahlberg and B. Shneiderman, Visual information seeking using the filmfinder. Proc. ACM SIGCHI Conference on Human Factors in Computing Systems, 2: 433, 1994.[3] B. Babcock, S. Chaudhuri, and G. Das, Dynamic sample selection for approximate query processing. Proc. ACM SIGMOD International Conference on Management of Data, pages 539–550, 2003.[4] B. Bederson, Pad++: Advances in multiscale interfaces. Proc. ACM SIGCHI Conference on Human Factors in Computing Systems, page 315, 1994.[5] P. Berkhin, Survey of clustering data mining techniques. Technical report, Accrue Software, San Jose, CA, 2002.[6] E. Bertini and G. Santucci, By chance is not enough: preserving relative density through nonuniform sampling. Eighth International Conference on Information Visualisation (IV'04), pages 622–629, 2004.[7] E. Bertini and G. Santucci, Quality metrics for 2d scatterplot graphics: automatically reducing visual clutter. Proc. of 4th International Symposium on SmartGraphics, pages 77–89, 2004.[8] F. Boutin and M. Hascoet, Cluster validity indices for graph partitioning. Eighth International Conference on Information Visualisation (IV'04), pages 376–381, 2004.[9] J. Bucher, The Metrology Handbook. ASQ Quality Press, Milwaukee, Wisc., 2004.[10] A. Dix and G. Ellis, By chance — enhancing interaction with large data sets through statistical sampling. Proc. Advanced Visual Interfaces, pages 167–176, 2002.[11] R. Duda, P. Hart, and D. Stork, Pattern classification. John Wiley and Sons, Inc., 2th edition, 2001.[12] Y. Fua, M. Ward, and E. Rundensteiner, Structure-based brushes: A mechanism for navigating hierarchically organized data and information spaces. IEEE Trans. on Visualization and Computer Graphics, 6 (2): 150–159, 2000.[13] M. Granitzer, W. Kienreich, V. Sabol, K. Andrews, and W. Klieber, Evaluating a system for interactive exploration of large, hierarchically structured document repositories. Proc. IEEE Symposium on Information Visualization, pages 127–134, 2004.[14] M. Kreuseler, N. Lopez, and H. Schumann, A scalable framework for information visualization. Proc. IEEE Symposium on Information Visualization, pages 27–36, 2000.[15] Y. Leung and M. Apperley, A review and taxonomy of distortion-oriented presentation techniques. ACM Transactions on Computer-Human Interaction, 1 (2): 126–160, 1994.[16] A. Martin and M. Ward, High dimensional brushing for interactive exploration of multivariate data. Proc. IEEE Visualization, pages 271–278, 1995.[17] W. Niblack, R. Barber, W. Equitz, M. Flickner, E. Glasman, D. Petkovic, P. Yanker, C. Faloutsos, and G. Taubin, The QBIC project: querying images by content using color, texture and shape. Proc. of SPIE Storage and Retrieval for Image and Video Databases, pages 173–187, 1993.[18] W. Peng, M. Ward, and E. Rundensteiner, Clutter reduction in multidimensional data visualization using dimension reordering. Proc. IEEE Symposium on Information Visualization, pages 89–96, 2004.[19] D. Rafiei and S. Curial, Effectively visualizing large networks through sampling. Proc. IEEE Visualization, pages 48–55, 2005.[20] E. Riskin, Optimal bit allocation via the generalized BFOS algorithm. IEEE Trans. on Information Theory, IT-37: 400–402, Mar. 1991.[21] R. Rosenholtz, Y. L., and J. Mansfield, and Z. Jin, Feature congestion: a measure of display clutter. Proc. ACM SIGCHI Conference on Human Factors in Computing Systems, pages 761–770, 2005.[22] D. Santa-Cruz, T. Ebrahimi, J. Askelof, M. Larsson, and C. Christopoulos, JPEG 2000 still image coding versus other standards. SPIE's 45th annual meeting, Applications of Digital Image Processing XXIII, 4115: 446–454, Aug 2000.[23] D. Scott, On optimal and data-based histograms. Biometrika, 66: 605–610, 1979.[24] S. Siggelkow, Feature Histograms for Content-Based Image Retrieval. Ph.D Thesis, University of Freiburg, Institute for Computer Science, Freiburg, Germany, 2002.[25] M. Swain and D. Ballard, Color indexing. International Journal of Computer Vision, 7: 11–32, 1991.[26] S. Thompson, Sampling. John Wiley and Sons, Inc., New York, 2th edition, 1992.[27] E. Tufte, The Visual Display of Quantitative Information. Graphics Press, Cheshire, CT, 1982.[28] J. Yang, M. Ward, and E. Rundensteiner, Hierarchical exploration of large multivariate data sets. Data Visualization: The State of the Art 2003, pages 201–212, 2003.

Index Terms:
Metrics, Clustering, Sampling, Multiresolution Visualization Authors 1:
Citation:
Qingguang Cui, Matthew Ward, Elke Rundensteiner, Jing Yang, "Measuring Data Abstraction Quality in Multiresolution Visualizations," IEEE Transactions on Visualization and Computer Graphics, vol. 12, no. 5, pp. 709-716, Sept. 2006, doi:10.1109/TVCG.2006.161
Usage of this product signifies your acceptance of the Terms of Use.