论文标题
确定性模型中准确和校准的不确定性的估计
Estimation of Accurate and Calibrated Uncertainties in Deterministic models
论文作者
论文摘要
在本文中,我们关注的问题是将不确定性分配给由输出连续变量的确定性模型产生的单点预测的问题。此问题适用于任何具有计算成本的最先进的物理或工程模型,该模型不容易允许运行合奏并估算与单点预测相关的不确定性。从本质上讲,我们设计了一种轻松将确定性预测转化为概率的方法。我们表明,这样做,必须在这种概率模型的准确性和可靠性(校准)之间妥协。因此,我们引入了一个编码其权衡的成本函数。我们使用连续的等级概率得分来衡量准确性,并在可靠性的情况下得出一个分析公式,如果预测以高斯分布表示的连续标量变量的预测。然后,通过解决两个目标优化问题,使用了新的准确性可靠性成本函数,以估算给定黑框均值函数的输入依赖性方差。这种策略背后的简单理念是,基于估计差异的预测不仅应该是准确的,而且应该是可靠的(即统计与观察一致)。相反,基于经典成本函数的最小化(例如负log概率密度)的早期作品不能同时执行准确性和可靠性。我们展示了几个示例,其中包括合成数据,可以准确地恢复潜在的隐藏噪声,并使用大型现实世界数据集恢复。
In this paper we focus on the problem of assigning uncertainties to single-point predictions generated by a deterministic model that outputs a continuous variable. This problem applies to any state-of-the-art physics or engineering models that have a computational cost that does not readily allow to run ensembles and to estimate the uncertainty associated to single-point predictions. Essentially, we devise a method to easily transform a deterministic prediction into a probabilistic one. We show that for doing so, one has to compromise between the accuracy and the reliability (calibration) of such a probabilistic model. Hence, we introduce a cost function that encodes their trade-off. We use the Continuous Rank Probability Score to measure accuracy and we derive an analytic formula for the reliability, in the case of forecasts of continuous scalar variables expressed in terms of Gaussian distributions. The new Accuracy-Reliability cost function is then used to estimate the input-dependent variance, given a black-box mean function, by solving a two-objective optimization problem. The simple philosophy behind this strategy is that predictions based on the estimated variances should not only be accurate, but also reliable (i.e. statistical consistent with observations). Conversely, early works based on the minimization of classical cost functions, such as the negative log probability density, cannot simultaneously enforce both accuracy and reliability. We show several examples both with synthetic data, where the underlying hidden noise can accurately be recovered, and with large real-world datasets.