Learning Translation Templates from Bilingual Translation Examples

作者:Ilyas Cicekli, H. Altay Güvenir

摘要

A mechanism for learning lexical correspondences between two languages from sets of translated sentence pairs is presented. These lexical level correspondences are learned using analogical reasoning between two translation examples. Given two translation examples, the similar parts of the sentences in the source language must correspond to the similar parts of the sentences in the target language. Similarly, the different parts must correspond to the respective parts in the translated sentences. The correspondences between similarities and between differences are learned in the form of translation templates. A translation template is a generalized translation exemplar pair where some components are generalized by replacing them with variables in both sentences and establishing bindings between these variables. The learned translation templates are obtained by replacing differences or similarities by variables. This approach has been implemented and tested on a set of sample training datasets and produced promising results for further investigation.

论文关键词:exemplar based machine learning, example-based machine translation, corpus-based machine translation, templates

论文评审过程:

论文官网地址:https://doi.org/10.1023/A:1011270708487