Iteratively constrained selection of word alignment links using knowledge and statistics

作者:

Highlights:

摘要

Word alignment is a crucial component in applications that use bilingual resources. Statistical methods are widely used because they are portable and allow simple system building. However, pure statistical methods often incorrectly align functional words in the English–Korean language pair due to differences in the typology of the languages and a lack of knowledge. Knowledge is inevitably required to correct errors and to improve word alignment quality. In this paper, we introduce an effective method that uses an iterative process to incorporate knowledge into the word alignment system. The method achieved significant improvements in word alignment and its application: statistical machine translation.

论文关键词:Bilingual resource,Parallel text,Machine translation,Word alignment,Korean-English

论文评审过程:Received 9 May 2010, Revised 17 April 2011, Accepted 22 May 2011, Available online 27 May 2011.

论文官网地址:https://doi.org/10.1016/j.knosys.2011.05.012