This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Thirty-Second Annual Hawaii International Conference on System Sciences-Volume 2
Maui, Hawaii
January 05-January 08
ISBN: 0-7695-0001-3
Text Mining is an active area of research and development, which combines and expands techniques found in related areas like information retrieval, computational linguistics, and data mining to perform an analysis of large corpora of digital documents. This paper describes the TaxGen Text Mining project carried out at the IBM Software Development Lab. at Boeblingen, Germany. The goal of TaxGen was the automatic generation of a taxonomy for a collection of previously unstructured documents, namely a set of 73.000 news wire documents spanning one year.
Citation:
Adrian Miiller, Jochen Dorre, Peter Gerstl, Roland Seiffert, "The TaxGen Framework: Automating the Generation of a Taxonomy for a Large Document Collection," hicss, vol. 2, pp.2034, Thirty-Second Annual Hawaii International Conference on System Sciences-Volume 2, 1999
Usage of this product signifies your acceptance of the Terms of Use.