论文标题
在降级方法中有偏见的风险
The risk of bias in denoising methods
论文作者
论文摘要
实验数据集的大小,范围和细节迅速增长,但是这些数据集的价值受到不需要的测量噪声的限制。因此,采用试图减少噪声并增强感兴趣信号的分析技术很容易。在本文中,我们提请人们注意脱氧方法可能引入偏见并导致不正确的科学推论的可能性。为了介绍我们的案例,我们首先回顾了偏见和差异的基本统计概念。去核技术通常会降低在重复测量中观察到的方差,但这可能是为了使偏见引起平均预期结果的代价。然后,我们进行了三个简单的模拟,提供了具体的例子,说明在日常情况下如何表现出偏见。这些模拟揭示了一些可能令人惊讶和违反直觉的发现:(i)不同的方法在减少差异方面同样有效,但有些可能会产生偏见,而另一些则没有,(ii)确定更好地恢复地面真理的方法不会保证没有偏见,(iii)偏见也可能存在感兴趣信号的特定属性知识。我们建议研究人员在部署重要研究数据的deno方法之前应该考虑并可能量化偏差。
Experimental datasets are growing rapidly in size, scope, and detail, but the value of these datasets is limited by unwanted measurement noise. It is therefore tempting to apply analysis techniques that attempt to reduce noise and enhance signals of interest. In this paper, we draw attention to the possibility that denoising methods may introduce bias and lead to incorrect scientific inferences. To present our case, we first review the basic statistical concepts of bias and variance. Denoising techniques typically reduce variance observed across repeated measurements, but this can come at the expense of introducing bias to the average expected outcome. We then conduct three simple simulations that provide concrete examples of how bias may manifest in everyday situations. These simulations reveal several findings that may be surprising and counterintuitive: (i) different methods can be equally effective at reducing variance but some incur bias while others do not, (ii) identifying methods that better recover ground truth does not guarantee the absence of bias, (iii) bias can arise even if one has specific knowledge of properties of the signal of interest. We suggest that researchers should consider and possibly quantify bias before deploying denoising methods on important research data.