loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
First International Conference on Innovative Computing, Information and Control - Volume II (ICICIC'06)
A Comparative Study on Supervised and Unsupervised Learning Approaches for Multilingual Text Categorization
Beijing, China
August 30-September 01
ISBN: 0-7695-2616-0
Chung-Hong Lee, National Kaohsiung University of Applied Sciences, Taiwan
Hsin-Chang Yang, Chang Jung University, Taiwan
Ting-Chung Chen, National Kaohsiung University of Applied Sciences, Taiwan
Sheng-Min Ma, National Kaohsiung University of Applied Sciences, Taiwan
Recently users of internationally distributed information networks need tools and methods that will enable them to discover, retrieve and categorize relevant information, in whatever language and form it may have been stored. This drives a convergence of numerous interests from diverse research communities focusing on the issues related to multilingual text categorization. In this work we compare and evaluate the performance of the leading supervised and unsupervised approaches for multilingual text categorization by using various performance measures and standard document corpora. For simplicity, we selected Support Vector Machines (SVM) and Latent Semantic Indexing (LSI) techniques as representatives of supervised and unsupervised methods for multilingual text categorization, respectively. The preliminary results show that our platform models including both supervised and unsupervised learning methods have the potentials for multilingual text categorization.
Citation:
Chung-Hong Lee, Hsin-Chang Yang, Ting-Chung Chen, Sheng-Min Ma, "A Comparative Study on Supervised and Unsupervised Learning Approaches for Multilingual Text Categorization," icicic, vol. 2, pp.511-514, First International Conference on Innovative Computing, Information and Control - Volume II (ICICIC'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.