论文标题

克里奥尔人想要什么,克里奥尔人需要什么

What a Creole Wants, What a Creole Needs

论文作者

Lent, Heather, Ogueji, Kelechi, de Lhoneux, Miryam, Ahia, Orevaoghene, Søgaard, Anders

论文摘要

近年来,自然语言处理(NLP)社区已经越来越关注针对低资源的努力的努力差异。纠正此三角洲的努力通常始于将现有英语数据集翻译成其他语言。但是,这种方法忽略了不同的语言社区有不同的需求。我们考虑一组低资源语言,克里奥尔语。 NLP文献中基本上缺乏克里奥尔语,尽管这些语言具有相当大且充满活力的社区,但由于污名化,社会也经常被整个社会所忽略。我们通过与克里奥尔专家的对话和讲克里奥尔语的社区的调查来证明,语言技术所需的事情如何从一种语言变成另一种语言,即使语言被认为是彼此非常相似的,就像克里奥尔语一样。我们讨论了这些对话引起的突出主题,并最终证明,如果不涉及相关社区,就无法建立有用的语言技术。

In recent years, the natural language processing (NLP) community has given increased attention to the disparity of efforts directed towards high-resource languages over low-resource ones. Efforts to remedy this delta often begin with translations of existing English datasets into other languages. However, this approach ignores that different language communities have different needs. We consider a group of low-resource languages, Creole languages. Creoles are both largely absent from the NLP literature, and also often ignored by society at large due to stigma, despite these languages having sizable and vibrant communities. We demonstrate, through conversations with Creole experts and surveys of Creole-speaking communities, how the things needed from language technology can change dramatically from one language to another, even when the languages are considered to be very similar to each other, as with Creoles. We discuss the prominent themes arising from these conversations, and ultimately demonstrate that useful language technology cannot be built without involving the relevant community.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源