loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2010 International Conference on Asian Language Processing
Sentence Similarity-Based Source Context Modelling in PBSMT
Harbin, Heilongjiang China
December 28-December 30
ISBN: 978-0-7695-4288-1
Target phrase selection, a crucial component of the state-of-the-art phrase-based statistical machine translation(PBSMT) model, plays a key role in generating accurate translation hypotheses. Inspired by context-rich word-sense disambiguation techniques, machine translation (MT) researchers have successfully integrated various types of source language context into the PBSMT model to improve target phrase selection. Among the various types of lexical and syntactic features, lexical syntactic descriptions in the form of super tags that preserve long-range word-to-word dependencies in a sentence have proven to be effective. These rich contextual features are able to disambiguate a source phrase, on the basis of the local syntactic behaviour of that phrase. In addition to local contextual information, global contextual information such as the grammatical structure of a sentence, sentence length and n-gram word sequences could provide additional important information to enhance this phrase-sense disambiguation. In this work, we explore various sentence similarity features by measuring similarity between a source sentence to be translated with the source-side of the bilingual training sentences and integrate them directly into the PBSMT model. We performed experiments on an English-to-Chinese translation task by applying sentence-similarity features both individually, and collaboratively with super tag-based features. We evaluate the performance of our approach and report a statistically significant relative improvement of 5.25% BLEU score when adding a sentence-similarity feature together with a super tag-based feature.
Index Terms:
sentence similarity, source context information, statistical machine translation
Citation:
Rejwanul Haque, Sudip Kumar Naskar, Andy Way, Marta R. Costa-jussà, Rafael E. Banchs, "Sentence Similarity-Based Source Context Modelling in PBSMT," ialp, pp.257-260, 2010 International Conference on Asian Language Processing, 2010
Usage of this product signifies your acceptance of the Terms of Use.