论文标题
作为守门人的单词:测量学术特定术语和含义的学术出版物
Words as Gatekeepers: Measuring Discipline-specific Terms and Meanings in Scholarly Publications
论文作者
论文摘要
学术文本通常充满术语或专业语言,可以促进田间内有效的小组内交流,但会阻碍对群体外的理解。在这项工作中,我们开发并验证一种可解释的方法来测量文本学术术语。扩大着专注于单词类型的先前工作范围,我们使用单词sense诱导来识别广泛但遍布各个领域的含义过多的单词。然后,我们估计这些学科特定的单词和感官在数百个子场中的流行率,并表明词感官提供了与单词类型一起的术语的互补性但独特的观点。我们通过强调两个关键的社会含义来证明我们对科学和计算社会语言学科学的指标的实用性。首先,尽管在为通用场地写作时,大多数领域都会减少对行话的使用,而某些领域(例如,生物科学)的作用比其他领域少。其次,术语与引文率之间的相关性方向在田野之间有所不同,但是术语几乎总是与跨学科影响负相关。从广义上讲,我们的发现表明,尽管多学科的场所打算迎合更多的一般受众,但某些领域的写作规范可能是障碍而不是桥梁,从而阻碍了学术思想的分散。
Scholarly text is often laden with jargon, or specialized language that can facilitate efficient in-group communication within fields but hinder understanding for out-groups. In this work, we develop and validate an interpretable approach for measuring scholarly jargon from text. Expanding the scope of prior work which focuses on word types, we use word sense induction to also identify words that are widespread but overloaded with different meanings across fields. We then estimate the prevalence of these discipline-specific words and senses across hundreds of subfields, and show that word senses provide a complementary, yet unique view of jargon alongside word types. We demonstrate the utility of our metrics for science of science and computational sociolinguistics by highlighting two key social implications. First, though most fields reduce their use of jargon when writing for general-purpose venues, and some fields (e.g., biological sciences) do so less than others. Second, the direction of correlation between jargon and citation rates varies among fields, but jargon is nearly always negatively correlated with interdisciplinary impact. Broadly, our findings suggest that though multidisciplinary venues intend to cater to more general audiences, some fields' writing norms may act as barriers rather than bridges, and thus impede the dispersion of scholarly ideas.