论文标题

加权经验风险最小化:基于重要性抽样的样本选择偏差校正

Weighted Empirical Risk Minimization: Sample Selection Bias Correction based on Importance Sampling

论文作者

Vogel, Robin, Achab, Mastane, Clémençon, Stéphan, Tillier, Charles

论文摘要

我们考虑统计学习问题,当培训观察的分配$ p'$ z'_1,\; \ ldots,\; Z'_n $与参与的分布$ p $不同,人们寻求最小化(称为测试分布),但仍在与$ p $相同的可测量空间上定义并主导它。 In the unrealistic case where the likelihood ratio $Φ(z)=dP/dP'(z)$ is known, one may straightforwardly extends the Empirical Risk Minimization (ERM) approach to this specific transfer learning setup using the same idea as that behind Importance Sampling, by minimizing a weighted version of the empirical risk functional computed from the 'biased' training data $Z'_i$ with weights $φ(z'_i)$。尽管重要性函数$φ(z)$通常在实践中是未知的,但我们表明,在实践中经常遇到的各种情况下,它采用了一种简单的形式,可以直接从$ z'_i $的s和统计人口$ p $的一些辅助信息中估算出来。通过线性化技术,我们证明,当将$φ(Z'_i)$的结果估计值插入加权经验风险时,可以保留上述方法的概括能力。除了这些理论保证之外,数值结果还提供了有力的经验证据,证明了本文促进的方法的相关性。

We consider statistical learning problems, when the distribution $P'$ of the training observations $Z'_1,\; \ldots,\; Z'_n$ differs from the distribution $P$ involved in the risk one seeks to minimize (referred to as the test distribution) but is still defined on the same measurable space as $P$ and dominates it. In the unrealistic case where the likelihood ratio $Φ(z)=dP/dP'(z)$ is known, one may straightforwardly extends the Empirical Risk Minimization (ERM) approach to this specific transfer learning setup using the same idea as that behind Importance Sampling, by minimizing a weighted version of the empirical risk functional computed from the 'biased' training data $Z'_i$ with weights $Φ(Z'_i)$. Although the importance function $Φ(z)$ is generally unknown in practice, we show that, in various situations frequently encountered in practice, it takes a simple form and can be directly estimated from the $Z'_i$'s and some auxiliary information on the statistical population $P$. By means of linearization techniques, we then prove that the generalization capacity of the approach aforementioned is preserved when plugging the resulting estimates of the $Φ(Z'_i)$'s into the weighted empirical risk. Beyond these theoretical guarantees, numerical results provide strong empirical evidence of the relevance of the approach promoted in this article.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源