论文标题

WMT20的Ubiqus英语Inuktitut系统

The Ubiqus English-Inuktitut System for WMT20

论文作者

Hernandez, François, Nguyen, Vincent

论文摘要

本文介绍了Ubiqus对WMT20英语Inuktitut共享新闻翻译任务的提交。我们的主要系统,也是唯一的提交是基于多语言方法,共同培训了几种凝集性语言的变压器模型。从数据选择,准备和令牌化到质量评估,英语Inuktitut翻译任务在每个步骤都具有挑战性。出现了困难,这两者都因为Inuktitut语言的特殊性以及低资源的背景。

This paper describes Ubiqus' submission to the WMT20 English-Inuktitut shared news translation task. Our main system, and only submission, is based on a multilingual approach, jointly training a Transformer model on several agglutinative languages. The English-Inuktitut translation task is challenging at every step, from data selection, preparation and tokenization to quality evaluation down the line. Difficulties emerge both because of the peculiarities of the Inuktitut language as well as the low-resource context.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源