论文标题
双论文:通过耦合引用的因果推断的简单框架
Twin Papers: A Simple Framework of Causal Inference for Citations via Coupling
论文作者
论文摘要
研究过程包括许多决定,例如如何权利以及在何处发表论文。在本文中,我们介绍了一个通用框架,以调查此类决策的影响。研究效果的主要困难是我们需要知道反事实结果,而实际上并非现实。我们框架的关键见解是灵感来自现有的反事实分析,其中研究人员将双胞胎视为反事实单位。提出的框架将一对彼此引用为双胞胎的论文。这些论文往往是平行的作品,在类似的主题和类似社区中。我们研究了采用不同决策的双论文,观察这些研究带来的研究影响的进步,并通过这些研究的影响差异来估计决策的影响。我们发布了我们的代码和数据,我们认为由于反事实研究的数据集缺乏,我们认为这是非常有益的。
The research process includes many decisions, e.g., how to entitle and where to publish the paper. In this paper, we introduce a general framework for investigating the effects of such decisions. The main difficulty in investigating the effects is that we need to know counterfactual results, which are not available in reality. The key insight of our framework is inspired by the existing counterfactual analysis using twins, where the researchers regard twins as counterfactual units. The proposed framework regards a pair of papers that cite each other as twins. Such papers tend to be parallel works, on similar topics, and in similar communities. We investigate twin papers that adopted different decisions, observe the progress of the research impact brought by these studies, and estimate the effect of decisions by the difference in the impacts of these studies. We release our code and data, which we believe are highly beneficial owing to the scarcity of the dataset on counterfactual studies.