论文标题
NLP偏见研究中的“性别”理论
Theories of "Gender" in NLP Bias Research
论文作者
论文摘要
对包含和永久性社会偏见的自然语言处理(NLP)技术的关注兴起,导致了丰富而快速增长的研究领域。性别偏见是正在分析的核心偏见之一,但迄今为止,人们对该领域中“性别”的方式尚无全面分析。我们调查了有关NLP中性别偏见的近200篇文章,以发现该领域如何明确地概念化性别(例如,通过术语的定义)和隐式(例如,通过在实践中进行性别运作)。为了更好地了解新兴的思想轨迹,我们将这些文章分为两个部分。 我们发现,大多数文章也不会明确地对性别理论化,即使它们明确定义了“偏见”。几乎没有人使用与非二进制性别相交或包含的性别模型;许多人以无视跨性别,非二元和双性恋者的存在和经验的方式将性特征,社会性别和语言性别混为一谈。在陈述的两个时间段之间,人们承认性别是一个复杂的现实,但是,很少有文章能够将此承认付诸实践。除了分析这些发现外,我们还提供了具体的建议,以促进跨学科工作,并纳入性别研究中的理论和方法。我们希望这将在NLP中产生更具包容性的性别偏见研究。
The rise of concern around Natural Language Processing (NLP) technologies containing and perpetuating social biases has led to a rich and rapidly growing area of research. Gender bias is one of the central biases being analyzed, but to date there is no comprehensive analysis of how "gender" is theorized in the field. We survey nearly 200 articles concerning gender bias in NLP to discover how the field conceptualizes gender both explicitly (e.g. through definitions of terms) and implicitly (e.g. through how gender is operationalized in practice). In order to get a better idea of emerging trajectories of thought, we split these articles into two sections by time. We find that the majority of the articles do not make their theorization of gender explicit, even if they clearly define "bias." Almost none use a model of gender that is intersectional or inclusive of nonbinary genders; and many conflate sex characteristics, social gender, and linguistic gender in ways that disregard the existence and experience of trans, nonbinary, and intersex people. There is an increase between the two time-sections in statements acknowledging that gender is a complicated reality, however, very few articles manage to put this acknowledgment into practice. In addition to analyzing these findings, we provide specific recommendations to facilitate interdisciplinary work, and to incorporate theory and methodology from Gender Studies. Our hope is that this will produce more inclusive gender bias research in NLP.