通过不确定性定量补充半监督学习

论文标题

通过不确定性定量补充半监督学习

Complementing Semi-Supervised Learning with Uncertainty Quantification

论文作者

Kazemi, Ehsan

论文摘要

完全监督分类的问题是，它需要大量的注释数据，但是，在许多数据集中，很大一部分数据是未标记的。为了减轻此问题，半监督学习（SSL）利用分类器在标记的域上的知识，并将其推送到无标记的域，该域具有与注释数据相似的分布。 SSL方法的最新成功至关重要地取决于阈值伪标记，从而对未标记的域的一致性正则化。但是，现有方法并未在训练过程中纳入伪标签或未标记样本的不确定性，这是由于嘈杂的标签或由于强大的增强而导致的分布样品。受SSL最近发展的启发，我们的目标是提出一个新颖的无监督不确定性感知的目标，依赖于核心和认识论不确定性量化。通过提出的不确定性感知损失函数来补充SSL中最近的技术，我们的方法表现优于标准SSL基准测试，而在计算上轻量轻量级也与最新的SSL基准相当。我们的结果在复杂数据集（例如CIFAR-100和MINI-IMAGENET）上的最新结果优于最先进的结果。

The problem of fully supervised classification is that it requires a tremendous amount of annotated data, however, in many datasets a large portion of data is unlabeled. To alleviate this problem semi-supervised learning (SSL) leverages the knowledge of the classifier on the labeled domain and extrapolates it to the unlabeled domain which has a supposedly similar distribution as annotated data. Recent success on SSL methods crucially hinges on thresholded pseudo labeling and thereby consistency regularization for the unlabeled domain. However, the existing methods do not incorporate the uncertainty of the pseudo labels or unlabeled samples in the training process which are due to the noisy labels or out of distribution samples owing to strong augmentations. Inspired by the recent developments in SSL, our goal in this paper is to propose a novel unsupervised uncertainty-aware objective that relies on aleatoric and epistemic uncertainty quantification. Complementing the recent techniques in SSL with the proposed uncertainty-aware loss function our approach outperforms or is on par with the state-of-the-art over standard SSL benchmarks while being computationally lightweight. Our results outperform the state-of-the-art results on complex datasets such as CIFAR-100 and Mini-ImageNet.

下载PDF全文

下载文献需遵守相关版权规定

论文标题