The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.01 - January/February (2012 vol.32)
pp: 34-45
Jamal Alsakran , Kent State University
Yang Chen , University of North Carolina at Charlotte
Dongning Luo , University of North Carolina at Charlotte
Ye Zhao , Kent State University
Jing Yang , University of North Carolina at Charlotte
Wenwen Dou , University of North Carolina at Charlotte
Shixia Liu , Microsoft Research Asia
ABSTRACT
Streamit lets users explore visualizations of text streams without prior knowledge of the data. It incorporates incoming documents from a continuous source into an existing visualization context with automatic grouping and separation based on document similarities. Streamit generates document clusters to promote better understanding. To obtain different clusterings, users can adjust the keyword importance on the fly. Topic modeling represents the documents with higher-level semantic meanings. System performance has been optimized to achieve instantaneous animated visualization even for very large data collections. A powerful user interface allows in-depth data analysis. The video shows an example of applying our system on 1,000 US National Science Foundation Information and Intelligent Systems award abstracts funded between March 2000 and August 2003. The visual layout consists of a main window (left view), an animation control panel (bottom), control tools (top right), a keyword table (middle right), and document tables (bottom right). Documents are represented by pies whose size conveys the project's funding. The example shows how clusters of documents are generated and dynamically evolve (move, split, or merge) as new documents are inserted. The simulation places new documents relatively close to similar ones, creating clusters that each have an assigned color. Clusters maintain their colors, which facilitates the visual tracking of their behavior. However, when the system generates new clusters (for example, a cluster splits into two or more clusters), it assigns them unique colors to ease the visual tracking of them as they evolve. For example, in the video, the section from 00:21 to 00:25 shows how the red cluster splits into two clusters: a cluster that keeps the same red color and a new light-blue cluster. Finally, the spiral view (00:32–00:35) lets users examine the clusters' temporal trends.
INDEX TERMS
Streamit, data analysis, text analysis, keyword importance, document analysis, visual analytics, force-directed model, dynamic keyword importance, GPU acceleration, computer graphics
CITATION
Jamal Alsakran, Yang Chen, Dongning Luo, Ye Zhao, Jing Yang, Wenwen Dou, Shixia Liu, "Real-Time Visualization of Streaming Text with a Force-Based Dynamic System", IEEE Computer Graphics and Applications, vol.32, no. 1, pp. 34-45, January/February 2012, doi:10.1109/MCG.2011.91
REFERENCES
1. J. Alsakran et al., "Streamit: Dynamic Visualization and Interactive Exploration of Text Streams," Proc. 2011 IEEE Pacific Visualization Symp. (PacificVis 11), IEEE Press, 2011, pp. 131–138.
2. G. Salton and C. Buckley, "Term-Weighting Approaches in Automatic Text Retrieval," Information Processing and Management, vol. 24, no. 5, 1988, pp. 513–523.
3. D.M. Blei, A.Y. Ng, and M.I. Jordan, "Latent Dirichlet Allocation," J. Machine Learning Research, Jan. 2003, pp. 993–1022.
4. I.-S. Kang, T.-W. Kim, and K.-J. Li, "A Spatial Data Mining Method by Delaunay Triangulation," Proc. 5th ACM Int'l Workshop Advances in Geographic Information Systems (GIS 97), ACM Press, 1997, pp. 35–39.
5. J. Carlis and J. Konstan, "Interactive Visualization of Serial Periodic Data," Proc. 11th Ann. ACM Symp. User Interface Software and Technology, ACM Press, 1998, pp. 29–38.
6. L. Nyland, M. Harris, and J. Prins, "Fast N-Body Simulation with CUDA," CPU Gems 3, H. Nguyen ed., Addison-Wesley Professional, 2007, pp. 677–696; http.developer.nvidia.com/GPUGems3gpugems3_ch31.html .
17 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool