Machine translation for Arabic dialects (survey)

作者:

Highlights:

摘要

Arabic dialects also called colloquial Arabic or vernaculars are spoken varieties of Standard Arabic. These dialects have mixed form with many variations due to the influence of ancient local tongues and other languages like European ones. Many of these dialects are mutually incomprehensible. Arabic dialects were not written until recently and were used only in a speech form. Nowadays, with the advent of the internet and mobile telephony technologies, these dialects are increasingly used in a written form. Indeed, this kind of communication brought everyday conversations to a written format. This allows Arab people to use their dialects, which are their actual native languages for expressing their opinion on social media, for chatting, texting, etc. This growing use opens new research direction for Arabic natural language processing (NLP). We focus, in this paper, on machine translation in the context of Arabic dialects. We provide a survey of recent research in this area. We report for each study a detailed description of the adopted approach and we give its most relevant contribution.

论文关键词:Arabic dialect,Modern standard arabic,Machine translation

论文评审过程:Received 31 October 2016, Revised 21 July 2017, Accepted 17 August 2017, Available online 31 August 2017, Version of Record 7 January 2019.

论文官网地址:https://doi.org/10.1016/j.ipm.2017.08.003