The Community for Technology Leaders
RSS Icon
Issue No.06 - November/December (2009 vol.15)
pp: 945-952
Katerina Vrotsou , Norrköping Visualization and Interaction Studio, Linköping University
Jimmy Johansson , Norrköping Visualization and Interaction Studio, Linköping University
Matthew Cooper , Norrköping Visualization and Interaction Studio, Linköping University
The identification of significant sequences in large and complex event-based temporal data is a challenging problem with applications in many areas of today's information intensive society. Pure visual representations can be used for the analysis, but are constrained to small data sets. Algorithmic search mechanisms used for larger data sets become expensive as the data size increases and typically focus on frequency of occurrence to reduce the computational complexity, often overlooking important infrequent sequences and outliers. In this paper we introduce an interactive visual data mining approach based on an adaptation of techniques developed for web searching, combined with an intuitive visual interface, to facilitate user-centred exploration of the data and identification of sequences significant to that user. The search algorithm used in the exploration executes in negligible time, even for large data, and so no pre-processing of the selected data is required, making this a completely interactive experience for the user. Our particular application area is social science diary data but the technique is applicable across many other disciplines.
interactive visual exploration, event-based data, sequence identification, graph similarity, node similarity
Katerina Vrotsou, Jimmy Johansson, Matthew Cooper, "ActiviTree: Interactive Visual Exploration of Sequences in Event-Based Data Using Graph Similarity", IEEE Transactions on Visualization & Computer Graphics, vol.15, no. 6, pp. 945-952, November/December 2009, doi:10.1109/TVCG.2009.117
[1] A. Abbott and A. Tsay, Sequence analysis and optimal matching methods in sociology: Review and prospect. Sociological Methods Research, 29 (1): 3–33, 2000.
[2] R. Agrawal, T. Imielisnki, and A. Swami, Mining association rules between sets of items in large databases. In 1993 ACM-SIGMOD International Conference on Management of Data (SIGMOD'93), pages 207– 216, Washington, DC, 1993.
[3] R. Agrawal and R. Srikant, Mining sequential patterns. In P. S. Yu, and A. S. P. Chen editors, Eleventh International Conference on Data Engineering, pages 3–14, Taipei, Taiwan, 1995. IEEE Computer Society Press.
[4] V. Blondel, A. Gajardo, M. Heymans, P. Senellart, and P. V. Dooren, A measure of similarity between graph vertices: Applications to synonym extraction and web searching. SIAM Review, 46 (4): 647–666, 2004.
[5] R. Durbin, S. R. Eddy, A. Krogh, and G. Mitchison, Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids. Cambridge University Press, 1998.
[6] K. Ellegård, A time-geographical approach to the study of everyday life of individuals - a challenge of complexity. GeoJournal, 48 (3): 167–175, July 1999.
[7] K. Ellegård and K. Vrotsou, Capturing patterns of everyday life - presentation of the visualization method VISUAL-TimePAcTS. In IATUR XXVIII Annual Conference, Copenhagen, Denmark, August 2006.
[8] J. Fails, A. Karlson, L. Shahamat, and B. Shneiderman, A visual interface for multivariate temporal data: Finding patterns of events across multiple histories. In Proceedings of IEEE Symposium on Visual Analytics Science and Technology (VAST), October 31 - November 2 2006.
[9] P. Gatalsky, N. Andrienko, and G. Andrienko, Interactive analysis of event data using space-time cube. In Proceedings of the 8th International Conference on Information Visualization (IV'04), 2004.
[10] A. Godwin, R. Chang, R. Kosara, and W. Ribarsky, Visual data mining of unevenly-spaced event sequences. Interactive Poster in IEEE Symposium on Visual Analytics 2008, 2008.
[11] T. Hägerstrand, What about people in regional science? Papers in Regional Science, 24 (1): 6–21, December 1970.
[12] J. Han, H. Cheng, D. Xin, and X. Yan, Frequent pattern mining: current status and future directions. Data Mining and Knowledge Discovery, 15 (1): 55–86, 2007.
[13] J. M. Kleinberg, Authoritative sources in a hyperlinked environment. Journal of the ACM, 46: 668–677, 1999.
[14] M.-J. Kraak, The space-time cube revisited from a geovisualization perspective. In Proceedings of the 21st International Cartographic Conference, pages 1988–1995, Durban, South Africa, August 2003.
[15] M.-P. Kwan, GIS methods in time-geographic research: Geocomputation and geovisualization of human activity patterns. Geografiska Annaler: Series B, Human Geography, 86 (4): 267–280, December 2004.
[16] L. Page, S. Brin, R. Motwani, and T. Winograd, The pagerank citation ranking: Bringing order to the web. Stanford Digital Library Working Papers, Stanford, CA, 1998.
[17] J. Pei, J. Han, B. Mortazavi-asl, H. Pinto, Q. Chen, U. Dayal, and M.-c. Hsu, Prefixspan mining sequential patterns efficiently by prefix projected pattern growth. In 17th International Conference on Data Engineering, pages 215–226, 2001.
[18] C. Plaisant, B. Milash, A. Rose, S. Widoff, and B. Shneiderman, Life-lines: visualizing personal histories. In CHI '96: Proceedings of the SIGCHI conference on Human factors in computing systems, pages 221– ff. ACM, 1996.
[19] A. J. Pretorius and J. J. Van Wijk, Visual analysis of multivariate state transition graphs. IEEE Transactions on Visualization and Computer Graphics, 12 (5): 685–692, 2006.
[20] A. J. Pretorius and J. J. van Wijk, Visual inspection of multivariate graphs. Computer Graphics Forum, 27 : 967–974, 2008.
[21] U. Turdukulov, M. Kraak, and C. Blok, Designing a visual environment for exploration of time series of remote sensing data: In search for convective clouds. Computers & Graphics, 31 (3): 370–379, 2007.
[22] F. van Ham, H. van de Wetering, and J. van Wijk, Interactive visualization of state transition systems. Visualization and Computer Graphics, IEEE Transactions on, 8 (4): 319–329, Oct/Dec 2002.
[23] K. Vrotsou, K. Ellegård, and M. Cooper, Everyday life discoveries: Mining and visualizing activity patterns in social science diary data. In Proceedings of the 11th International Conference on Information Visualization, pages 130–138 Zörich, Switzerland, July 2007.
[24] T. D. Wang, C. Plaisant, A. J. Quinn, R. Stanchak, S. Murphy, and B. Shneiderman, Aligning temporal data by sentinel events: discovering patterns in electronic health records. In CHI '08: Proceeding of the twenty-sixth annual SIGCHI conference on Human factors in computing systems, pages 457–466. ACM, 2008.
[25] A. H. Youssefi, D. J. Duke, M. J. Zaki, and E. P. Glinert, Toward visual web mining. In Visual Data Mining Workshop IEEE International Conference on Data Mining, 2003.
[26] M. J. Zaki, SPADE: An efficient algorithm for mining frequent sequences. Machine Learning, 42 (1/2): 31–60, 2001.
[27] J. Zhao, P. Forer, and A. S. Harvey, Activities, ringmaps and geovisualization of large human movement fields. Information Visualization, 7 (3-4): 198–209, 2008.
21 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool