The Community for Technology Leaders
RSS Icon
Issue No.06 - November/December (2010 vol.16)
pp: 973-979
Hadley Wickham , Rice University
Dianne Cook , Iowa State University
Heike Hofmann , Iowa State University
Andreas Buja , Wharton School, University of Pennsylvania
How do we know if what we see is really there? When visualizing data, how do we avoid falling into the trap of apophenia where we see patterns in random noise? Traditionally, infovis has been concerned with discovering new relationships, and statistics with preventing spurious relationships from being reported. We pull these opposing poles closer with two new techniques for rigorous statistical inference of visual discoveries. The "Rorschach" helps the analyst calibrate their understanding of uncertainty and "line-up" provides a protocol for assessing the significance of visual discoveries, protecting against the discovery of spurious structure.
Statistics, visual testing, permutation tests, null hypotheses, data plot
Hadley Wickham, Dianne Cook, Heike Hofmann, Andreas Buja, "Graphical inference for infovis", IEEE Transactions on Visualization & Computer Graphics, vol.16, no. 6, pp. 973-979, November/December 2010, doi:10.1109/TVCG.2010.161
[1] A. Buja, D. Cook, H. Hofmann, M. Lawrence, E.-K. Lee, D. F. Swayne, and H. Wickham, "Statistical inference for exploratory data analysis and model diagnostics," Royal Society Philosophical Transactions A, vol. 367, no. 1906, pp. 4361–4383, 2009.
[2] E. L. Scott, C. D. Shane, and M. D. Swanson, "Comparison of the synthetic and actual distribution of galaxies on a photographic plate," Astrophysical Journal, vol. 119, pp. 91–112, Jan. 1954.
[3] A. M. Noll, "Human or machine: A subjective comparison of piet mondrian's "composition with lines" (1917) and a computer-generated picture," The Psychological Record, vol. 16,, pp. 1–10, 1966.
[4] C. Daniel, Applications of Statistics to Industrial Experimentation. Hoboken, NJ: Wiley-Interscience, 1976.
[5] P. Diaconis, "Theories of data analysis: From magical thinking through classical statistics," in Exploring Data Tables, Trends and Shapes ( D. Hoaglin, F. Mosteller, and J. Tukey eds.), pp. 1–36, New York: Wiley, 1983.
[6] A. C. Davison, and D. V. Hinkley, Bootstrap Methods and their Applications. Cambridge, UK: Cambridge University Press, 1997.
[7] A. Buja, D. Asimov, C. Hurley, and J. A. McDonald, "Elements of a viewing pipeline for data analysis," in Dynamic Graphics for Statistics, Wadsworth, Inc., 1988.
[8] J. M. Wolfe, M. J. V. Wert, "Varying target prevalence reveals two dissociable decision criteria in visual search," Current Biology, vol. 20, no. 2, pp. 121–124, 2010.
[9] E. J. G. Pitman, "Significance tests which may be applied to samples from any populations," The Journal of the Royal Statistical Society, vol. 4, pp. 119–130, 1937.
[10] P. Good, Permutation, Parametric, and Bootstrap Tests of Hypotheses. New York: Springer, 2005.
[11] F. Viègas, M. Wattenberg, F. van Ham, J. Kriss, and M. McKeon, "Manyeyes: A site for visualization at internet scale," Transactions on Visualization and Computer Graphics, vol. 13, pp. 1121–1128, 2007.
[12] Ø. Langsrud, "Rotation tests," Statistics and Computing, vol. 15, no. 1, pp. 53–60, 2005.
[13] W. S. Cleveland and R. McGill, "Graphical perception: Theory, experimentation and application to the development of graphical methods.," Journal of the American Statistical Association, vol. 79, no. 387, pp. 531–554, 1984.
[14] C. Healey, "Perception in visualisation," 2009.
[15] R Development Core Team, R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria, 2009. ISBN 3-900051-07-0.
[16] Amazon, "Mechanical Turk," 2008.
[17] H. Wickham, ggplot2: Elegant graphics for data analysis. useR, Springer, July 2009.
429 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool