This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Sixth International Conference on Intelligent Systems Design and Applications (ISDA'06) Volume 1
Combining Multi-knowledge for Chinese Word Segmentation Disambiguation
Jinan, China
October 16-October 18
ISBN: 0-7695-2528-8
Ying Qin, Beijing University of Posts and Telecommunications, China
Suxiang Zhang, Beijing University of Posts and Telecommunications, China
Xiaojie Wang, Beijing University of Posts and Telecommunications, China
In the task of Chinese word segmentation, there are two main segmentation ambiguities, overlapping ambiguity and combination ambiguity. The paper analyzes properties of ambiguities and supposes multiknowledge approach to disambiguate. Multiknowledge refers to the knowledge from statistic of large corpus and syntactic, semantic or discourse information about ambiguous words. Class based Ngram and Maximum Entropy model are applied to combining multi-knowledge and disambiguation.
Citation:
Ying Qin, Suxiang Zhang, Xiaojie Wang, "Combining Multi-knowledge for Chinese Word Segmentation Disambiguation," isda, vol. 1, pp.551-556, Sixth International Conference on Intelligent Systems Design and Applications (ISDA'06) Volume 1, 2006
Usage of this product signifies your acceptance of the Terms of Use.