校准分割网络具有基于保证金的标签平滑

论文标题

校准分割网络具有基于保证金的标签平滑

Calibrating Segmentation Networks with Margin-based Label Smoothing

论文作者

Murugesan, Balamurali, Liu, Bingyuan, Galdran, Adrian, Ayed, Ismail Ben, Dolz, Jose

论文摘要

尽管深层神经网络推动了视觉识别任务的不可否认的进展，但仍有最近的证据表明，这些模型的校准很差，从而导致过度自信的预测。最小化训练过程中跨熵损失的标准实践提升了与单热标签分配相匹配的预测软效果概率。然而，这产生了正确类别的较高效果激活，该类别明显大于其余的激活，这加剧了误解问题。分类文献的最新观察结果表明，预测的熵嵌入隐式或明确最大化的损失函数会产生最新的校准性能。尽管有这些发现，这些损失在校准医学图像分割网络的相关任务中的影响仍未得到探索。在这项工作中，我们提供了当前最新校准损失的统一约束优化的观点。具体而言，这些损失可以看作是线性惩罚（或拉格朗日术语）对logit距离施加平等约束的近似值。这表明了这种潜在的平等约束的重要局限性，随之而来的梯度不断地朝着非信息解决方案朝着非信息解决方案前进，这可能阻止在基于梯度的优化过程中达到模型的判别性能和校准之间的最佳折衷。根据我们的观察，我们提出了一个基于不平等约束的简单且灵活的概括，该概括在logit距离上施加了可控的余量。对各种公共医疗图像分割基准的全面实验表明，我们的方法在网络校准方面为这些任务设定了新的最新结果，而判别性能也得到了改善。

Despite the undeniable progress in visual recognition tasks fueled by deep neural networks, there exists recent evidence showing that these models are poorly calibrated, resulting in over-confident predictions. The standard practices of minimizing the cross entropy loss during training promote the predicted softmax probabilities to match the one-hot label assignments. Nevertheless, this yields a pre-softmax activation of the correct class that is significantly larger than the remaining activations, which exacerbates the miscalibration problem. Recent observations from the classification literature suggest that loss functions that embed implicit or explicit maximization of the entropy of predictions yield state-of-the-art calibration performances. Despite these findings, the impact of these losses in the relevant task of calibrating medical image segmentation networks remains unexplored. In this work, we provide a unifying constrained-optimization perspective of current state-of-the-art calibration losses. Specifically, these losses could be viewed as approximations of a linear penalty (or a Lagrangian term) imposing equality constraints on logit distances. This points to an important limitation of such underlying equality constraints, whose ensuing gradients constantly push towards a non-informative solution, which might prevent from reaching the best compromise between the discriminative performance and calibration of the model during gradient-based optimization. Following our observations, we propose a simple and flexible generalization based on inequality constraints, which imposes a controllable margin on logit distances. Comprehensive experiments on a variety of public medical image segmentation benchmarks demonstrate that our method sets novel state-of-the-art results on these tasks in terms of network calibration, whereas the discriminative performance is also improved.

下载PDF全文

下载文献需遵守相关版权规定

论文标题