论文标题
验证问题的端到端方法:学习正确的距离
An end-to-end approach for the verification problem: learning the right distance
论文作者
论文摘要
在此贡献中,我们通过引入与编码器共同训练的参数伪距离来增强度量学习设置。因此,为学习的距离样模型的输出绘制了几种解释。我们首先显示它近似可用于假设检验的似然比,并且它进一步诱导了来自同一和不同类别的示例的联合分布之间的较大差异。评估是在验证设置下进行的,该验证设置包括确定示例集是否属于同一类,即使此类类是新颖的,并且从未在训练过程中向模型呈现。经验评估表明,这种方法定义了验证问题的端到端方法,能够比基于余弦相似性的简单得分手更好地获得性能,并且进一步优于广泛使用的下游分类器。与实际距离的公制学习相比,我们进一步观察到训练在拟议的方法下得到了大大简化,不需要复杂的方案即可收集成对的示例。
In this contribution, we augment the metric learning setting by introducing a parametric pseudo-distance, trained jointly with the encoder. Several interpretations are thus drawn for the learned distance-like model's output. We first show it approximates a likelihood ratio which can be used for hypothesis tests, and that it further induces a large divergence across the joint distributions of pairs of examples from the same and from different classes. Evaluation is performed under the verification setting consisting of determining whether sets of examples belong to the same class, even if such classes are novel and were never presented to the model during training. Empirical evaluation shows such method defines an end-to-end approach for the verification problem, able to attain better performance than simple scorers such as those based on cosine similarity and further outperforming widely used downstream classifiers. We further observe training is much simplified under the proposed approach compared to metric learning with actual distances, requiring no complex scheme to harvest pairs of examples.