论文标题
语言模型的保守程度如何?适应性别中立代词的引入
How Conservative are Language Models? Adapting to the Introduction of Gender-Neutral Pronouns
论文作者
论文摘要
最近,以多种语言将性别中立代词引入了a)包括非二进制人和b)作为通用的单数。心理语言学的最新结果表明,性别中性代词(瑞典语)与人类加工困难无关。我们表明,这与自动处理形成鲜明对比。我们表明,丹麦语,英语和瑞典语中的性别中立代词与更高的困惑,更多分散的注意力模式以及下游表现较差有关。我们认为,语言模型中的这种保守性可能会限制性别中立代词的广泛采用,因此必须解决。
Gender-neutral pronouns have recently been introduced in many languages to a) include non-binary people and b) as a generic singular. Recent results from psycholinguistics suggest that gender-neutral pronouns (in Swedish) are not associated with human processing difficulties. This, we show, is in sharp contrast with automated processing. We show that gender-neutral pronouns in Danish, English, and Swedish are associated with higher perplexity, more dispersed attention patterns, and worse downstream performance. We argue that such conservativity in language models may limit widespread adoption of gender-neutral pronouns and must therefore be resolved.