This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology
Induction of Semantic Classes Based on Coordinate Patterns
Lyon, France
August 22-August 27
ISBN: 978-0-7695-4513-4
Many NLP and IR applications require semantic classification knowledge of words. However, manually constructing semantic classes is a time-consuming and labor-intensive task. In this paper, we present an algorithm for induction of Chinese semantic classes from natural language text based on coordinate patterns. First, several coordinate patterns are proposed to harvest high-quality coordinate instance. Second, an iterative clustering process is used to cluster words into semantic classes. The clustering process mainly used coordinate relation between words. Experiment results show that the proposed approach performs relatively well and achieves 53.2% in terms of precision. Finally, a thesaurus containing about 15000 Chinese words is generated automatically.
Index Terms:
semantic class, coordinate structure, bottom-up clustering, language resource
Citation:
Likun Qiu, Yunfang Wu, Jing Shi, Yanqiu Shao, Zhiyi Long, "Induction of Semantic Classes Based on Coordinate Patterns," wi-iat, vol. 3, pp.201-204, 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology, 2011
Usage of this product signifies your acceptance of the Terms of Use.