基于插值的对比度学习，用于半标签半监督学习

论文标题

基于插值的对比度学习，用于半标签半监督学习

Interpolation-based Contrastive Learning for Few-Label Semi-Supervised Learning

论文作者

Yang, Xihong, Hu, Xiaochang, Zhou, Sihang, Liu, Xinwang, Zhu, En

论文摘要

长期以来，半监督学习（SSL）已被证明是一种有限的标签模型的有效技术。在现有的文献中，基于一致性的基于正规化的方法，迫使扰动样本具有类似的预测，并具有原始的预测，这引起了人们对其有希望的准确性的广泛关注。但是，我们观察到，当标签变得极为有限时，这种方法的性能会大大降低，例如每个类别的2或3标签。我们的实证研究发现，主要问题在于语义信息在数据增强过程中的漂移。当提供足够的监督时，可以缓解问题。但是，如果几乎没有指导，错误的正则化将误导网络并破坏算法的性能。为了解决该问题，我们（1）提出了一种基于插值的方法，以构建更可靠的阳性样品对；（2）设计一种新颖的对比损失，以指导学习网络的嵌入以在样品之间线性变化，以通过扩大保证金决策边界来提高网络的歧视能力。由于未引入破坏性正则化，因此我们提出的算法的性能在很大程度上得到了改善。具体而言，所提出的算法的表现优于第二最佳算法（COMATT），而当CIFAR-10数据集中的每个类只有两个标签可用时，可以实现88.73％的分类精度，占5.3％。此外，我们通过通过我们提出的策略大大提高现有最新算法的性能，进一步证明了所提出的方法的普遍性。

Semi-supervised learning (SSL) has long been proved to be an effective technique to construct powerful models with limited labels. In the existing literature, consistency regularization-based methods, which force the perturbed samples to have similar predictions with the original ones have attracted much attention for their promising accuracy. However, we observe that, the performance of such methods decreases drastically when the labels get extremely limited, e.g., 2 or 3 labels for each category. Our empirical study finds that the main problem lies with the drifting of semantic information in the procedure of data augmentation. The problem can be alleviated when enough supervision is provided. However, when little guidance is available, the incorrect regularization would mislead the network and undermine the performance of the algorithm. To tackle the problem, we (1) propose an interpolation-based method to construct more reliable positive sample pairs; (2) design a novel contrastive loss to guide the embedding of the learned network to change linearly between samples so as to improve the discriminative capability of the network by enlarging the margin decision boundaries. Since no destructive regularization is introduced, the performance of our proposed algorithm is largely improved. Specifically, the proposed algorithm outperforms the second best algorithm (Comatch) with 5.3% by achieving 88.73% classification accuracy when only two labels are available for each class on the CIFAR-10 dataset. Moreover, we further prove the generality of the proposed method by improving the performance of the existing state-of-the-art algorithms considerably with our proposed strategy.

下载PDF全文

下载文献需遵守相关版权规定

论文标题