论文标题
EnergyMatch:半监督学习的基于能量的伪标记
EnergyMatch: Energy-based Pseudo-Labeling for Semi-Supervised Learning
论文作者
论文摘要
半监督学习(SSL)中最新的最新方法将一致性正规化与基于置信的伪标记结合在一起。为了获得高质量的伪标签,通常采用高置信度阈值。但是,已经表明,对于远离训练数据的样本,深网的基于软疗法的置信度得分可能是任意高的,因此,即使是高信心不明的样品,伪标签也可能是不可靠的。在这项工作中,我们提出了伪标记的新观点:而不是依靠模型信心,而是衡量未标记的样本是否可能是“分布”;即接近当前的培训数据。为了对未标记的样本进行分类,是“分布”还是“分发”,我们采用了分布外检测文献中的能量评分。随着培训的进行,更不标记的样本成为分配并有助于培训,标记和伪标记的数据的组合可以更好地近似于真正的分布以改善模型。实验表明,我们基于能量的伪标记方法,尽管在概念上很简单,但在不平衡的SSL基准测试方面明显优于基于置信度的方法,并在类平衡的数据上实现了竞争性能。例如,当失衡比高于50时,它会在CIFAR10-LT上产生4-6%的绝对准确性提高。与最先进的长尾SSL方法相结合时,可以实现进一步的改进。
Recent state-of-the-art methods in semi-supervised learning (SSL) combine consistency regularization with confidence-based pseudo-labeling. To obtain high-quality pseudo-labels, a high confidence threshold is typically adopted. However, it has been shown that softmax-based confidence scores in deep networks can be arbitrarily high for samples far from the training data, and thus, the pseudo-labels for even high-confidence unlabeled samples may still be unreliable. In this work, we present a new perspective of pseudo-labeling: instead of relying on model confidence, we instead measure whether an unlabeled sample is likely to be "in-distribution"; i.e., close to the current training data. To classify whether an unlabeled sample is "in-distribution" or "out-of-distribution", we adopt the energy score from out-of-distribution detection literature. As training progresses and more unlabeled samples become in-distribution and contribute to training, the combined labeled and pseudo-labeled data can better approximate the true distribution to improve the model. Experiments demonstrate that our energy-based pseudo-labeling method, albeit conceptually simple, significantly outperforms confidence-based methods on imbalanced SSL benchmarks, and achieves competitive performance on class-balanced data. For example, it produces a 4-6% absolute accuracy improvement on CIFAR10-LT when the imbalance ratio is higher than 50. When combined with state-of-the-art long-tailed SSL methods, further improvements are attained.