神经机器翻译的自定进度学习

论文标题

神经机器翻译的自定进度学习

Self-Paced Learning for Neural Machine Translation

论文作者

Wan, Yu, Yang, Baosong, Wong, Derek F., Zhou, Yikai, Chao, Lidia S., Zhang, Haibo, Chen, Boxing

论文摘要

最近的研究证明，神经机器翻译（NMT）的训练可以通过模仿人类的学习过程来促进。然而，这种课程学习的成就取决于手工制作的功能（例如句子长度或单词稀有性。我们通过提出自定进度的学习来改善此过程，在此过程中，NMT模型可以自动量化对培训示例的学习信心； 2）通过调节每个迭代步骤的损失来灵活控制其学习。对多个翻译任务的实验结果表明，所提出的模型比强基础和强大的基准和那些在翻译质量和收敛速度上接受人设计的课程训练的模型更好的性能。

Recent studies have proven that the training of neural machine translation (NMT) can be facilitated by mimicking the learning process of humans. Nevertheless, achievements of such kind of curriculum learning rely on the quality of artificial schedule drawn up with the handcrafted features, e.g. sentence length or word rarity. We ameliorate this procedure with a more flexible manner by proposing self-paced learning, where NMT model is allowed to 1) automatically quantify the learning confidence over training examples; and 2) flexibly govern its learning via regulating the loss in each iteration step. Experimental results over multiple translation tasks demonstrate that the proposed model yields better performance than strong baselines and those models trained with human-designed curricula on both translation quality and convergence speed.

下载PDF全文

下载文献需遵守相关版权规定

论文标题