论文标题

以用户为中心的性别重写

User-Centric Gender Rewriting

论文作者

Alhafni, Bashar, Habash, Nizar, Bouamor, Houda

论文摘要

在本文中,我们定义了在涉及两个用户(I和/或您)的上下文中重写性别的任务 - 具有独立语法性别偏好的第一和第二语法人员。我们专注于阿拉伯语,一种性别标记在形态上丰富的语言。我们开发了一个多步系统,该系统结合了基于规则和神经重写模型的积极方面。我们的结果成功地证明了这种方法在最近创建的阿拉伯性别重写语料库上的生存能力,在盲验测试集中达到了88.42 m2 f0.5。我们提出的系统对此任务的仅第一人称版本的先前工作改进了3.05 M2 F0.5的绝对增加。我们通过使用该系统来编辑商业MT系统的输出,以根据用户的语法性别偏好提供个性化输出来证明我们的性别重写系统的用例。我们公开提供代码,数据和模型。

In this paper, we define the task of gender rewriting in contexts involving two users (I and/or You) - first and second grammatical persons with independent grammatical gender preferences. We focus on Arabic, a gender-marking morphologically rich language. We develop a multi-step system that combines the positive aspects of both rule-based and neural rewriting models. Our results successfully demonstrate the viability of this approach on a recently created corpus for Arabic gender rewriting, achieving 88.42 M2 F0.5 on a blind test set. Our proposed system improves over previous work on the first-person-only version of this task, by 3.05 absolute increase in M2 F0.5. We demonstrate a use case of our gender rewriting system by using it to post-edit the output of a commercial MT system to provide personalized outputs based on the users' grammatical gender preferences. We make our code, data, and models publicly available.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源