This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology
Exploring Word Similarity to Improve Chinese Personal Name Disambiguation
Lyon, France
August 22-August 27
ISBN: 978-0-7695-4513-4
This paper presents an approach to the Chinese Personal Name Disambiguation (PND). The key to clustering is the similarity measure of context, which depends on the features selection and representation and calculation method. First HIT Tongyici Cilin (Extended) is introduced to Chinese PND to enhance the clustering effect. Exploration about more word similarity is also performed to alleviate the data sparseness. In this system, a HAC (Hierarchical Agglomerative Clustering) algorithm is adopted to cluster the mentions referring to a same person with features extracted from documents. The results show that the word similarity information is very helpful to improve the system's performance.
Index Terms:
Chinese PND, Word Similarity, Tongyici Cilin, HAC algorithm
Citation:
Xia Yang, Peng Jin, Wei Xiang, "Exploring Word Similarity to Improve Chinese Personal Name Disambiguation," wi-iat, vol. 3, pp.197-200, 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology, 2011
Usage of this product signifies your acceptance of the Terms of Use.