loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2007 IEEE International Parallel and Distributed Processing Symposium
Scalable Visual Analytics of Massive Textual Datasets
Long Beach, CA, USA
March 26-March 30
ISBN: 1-4244-0909-8
M. Krishnan, Pacific Northwest National Laboratory, manoj@pnl.gov
S. Bohn, Pacific Northwest National Laboratory, shawn.bohn@pnl.gov
W. Cowley, Pacific Northwest National Laboratory, wendy@pnl.gov
V. Crow, Pacific Northwest National Laboratory, vern.crow@pnl.gov
J. Nieplocha, Pacific Northwest National Laboratory, jarek.nieplocha@pnl.gov
This paper describes the first scalable implementation of a text processing engine used in visual analytics tools. These tools aid information analysts in interacting with and understanding large textual information content through visual interfaces. By developing a parallel implementation of the text processing engine, we enabled visual analytics tools to exploit cluster architectures and handle massive datasets. The paper describes key elements of our parallelization approach and demonstrates virtually linear scaling when processing multi-gigabyte data sets such as Pubmed. This approach enables interactive analysis of large datasets beyond capabilities of existing state-of-the art visual analytics tools.
Citation:
M. Krishnan, S. Bohn, W. Cowley, V. Crow, J. Nieplocha, "Scalable Visual Analytics of Massive Textual Datasets," ipdps, pp.42, 2007 IEEE International Parallel and Distributed Processing Symposium, 2007
Usage of this product signifies your acceptance of the Terms of Use.