具有对抗性denoising训练的强大无监督神经机器翻译

论文标题

具有对抗性denoising训练的强大无监督神经机器翻译

Robust Unsupervised Neural Machine Translation with Adversarial Denoising Training

论文作者

Sun, Haipeng, Wang, Rui, Chen, Kehai, Lu, Xugang, Utiyama, Masao, Sumita, Eiichiro, Zhao, Tiejun

论文摘要

无监督的神经机器翻译（UNMT）最近对机器翻译社区引起了极大的兴趣。 UNMT的主要优点在于其简单的大型培训文本句子的易于收集，而其性能略高于监督的神经机器翻译，这需要在某些翻译任务上进行昂贵的带注释的翻译对。在大多数研究中，UMNT经过干净的数据训练，而无需考虑其对嘈杂数据的鲁棒性。但是，在实际情况下，通常存在收集的输入句子中的噪声，这会降低翻译系统的性能，因为UNMT对输入句子的小扰动很敏感。在本文中，我们首次明确考虑嘈杂的数据，以提高基于UNMT的系统的鲁棒性。首先，我们在训练句子中清楚地定义了两种类型的噪声，即单词噪声和单词顺序噪声，并从经验上研究了其在UNMT中的效果，然后我们提出了通过UNMT中的DeNosing过程的对抗性训练方法。几种语言对的实验结果表明，我们提出的方法在嘈杂的情况下大大提高了常规UNMT系统的鲁棒性。

Unsupervised neural machine translation (UNMT) has recently attracted great interest in the machine translation community. The main advantage of the UNMT lies in its easy collection of required large training text sentences while with only a slightly worse performance than supervised neural machine translation which requires expensive annotated translation pairs on some translation tasks. In most studies, the UMNT is trained with clean data without considering its robustness to the noisy data. However, in real-world scenarios, there usually exists noise in the collected input sentences which degrades the performance of the translation system since the UNMT is sensitive to the small perturbations of the input sentences. In this paper, we first time explicitly take the noisy data into consideration to improve the robustness of the UNMT based systems. First of all, we clearly defined two types of noises in training sentences, i.e., word noise and word order noise, and empirically investigate its effect in the UNMT, then we propose adversarial training methods with denoising process in the UNMT. Experimental results on several language pairs show that our proposed methods substantially improved the robustness of the conventional UNMT systems in noisy scenarios.

下载PDF全文

下载文献需遵守相关版权规定

论文标题