论文标题

AI4D-非洲语言数据集挑战

AI4D -- African Language Dataset Challenge

论文作者

Siminyu, Kathleen, Freshia, Sackey, Abbott, Jade, Marivate, Vukosi

论文摘要

随着语言和语音技术变得更加先进,缺乏非洲语言的基本数字资源,例如数据,拼写检查器和语音标记者的一部分,这意味着这些语言与其他语言之间的数字鸿沟和其他语言之间的鸿沟不断增长。这项工作详细介绍了AI4D-非洲语言数据集挑战的组织,以通过竞争挑战来激励非洲语言数据集的创建,组织和发现。我们特别鼓励提交带注释的数据集,这些数据集可用于培训特定于任务的监督机器学习模型。

As language and speech technologies become more advanced, the lack of fundamental digital resources for African languages, such as data, spell checkers and Part of Speech taggers, means that the digital divide between these languages and others keeps growing. This work details the organisation of the AI4D - African Language Dataset Challenge, an effort to incentivize the creation, organization and discovery of African language datasets through a competitive challenge. We particularly encouraged the submission of annotated datasets which can be used for training task-specific supervised machine learning models.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源