L2B：学习为打击标签噪声的启动稳健模型

论文标题

L2B：学习为打击标签噪声的启动稳健模型

L2B: Learning to Bootstrap Robust Models for Combating Label Noise

论文作者

Zhou, Yuyin, Li, Xianhang, Liu, Fengze, Wei, Qingyue, Chen, Xuxi, Yu, Lequan, Xie, Cihang, Lungren, Matthew P., Xing, Lei

论文摘要

深度神经网络在表示学习方面取得了巨大成功。但是，当使用嘈杂的标签学习（LNL）时，它们可以轻松地过度合适，并且无法推广到新数据。本文介绍了一种简单有效的方法，名为“学习到引导程序（L2B）”，该方法使模型可以使用自己的预测自行引导自己，而不会受到错误的伪标签的不利影响。它通过动态调整实际观察和生成的标签之间的重要性以及通过元学习的不同样品之间的重要性来实现这一目标。与现有的实例重新加权方法不同，我们方法的关键在于一个新的多功能目标，可以同时进行隐式重新标记，从而导致重大改进而不会产生额外的成本。 L2B比基线方法提供了一些好处。它产生的更健壮的模型通过更有效地引导引导程序不太容易受到嘈杂标签的影响。它通过调整实例和标签的权重来更好地利用损坏实例中包含的有价值的信息。此外，L2B与现有的LNL方法兼容，并提供竞争结果，涵盖了自然和医学成像任务，包括合成和现实世界中的分类和分割。广泛的实验表明，我们的方法有效地减轻了嘈杂标签的挑战，通常需要很少至没有验证样本，并且已经概括为其他任务，例如图像分割。这不仅将其定位为对现有LNL技术的强大补充，而且强调了其实际适用性。代码和型号可在https://github.com/yuyinzhou/l2b上找到。

Deep neural networks have shown great success in representation learning. However, when learning with noisy labels (LNL), they can easily overfit and fail to generalize to new data. This paper introduces a simple and effective method, named Learning to Bootstrap (L2B), which enables models to bootstrap themselves using their own predictions without being adversely affected by erroneous pseudo-labels. It achieves this by dynamically adjusting the importance weight between real observed and generated labels, as well as between different samples through meta-learning. Unlike existing instance reweighting methods, the key to our method lies in a new, versatile objective that enables implicit relabeling concurrently, leading to significant improvements without incurring additional costs. L2B offers several benefits over the baseline methods. It yields more robust models that are less susceptible to the impact of noisy labels by guiding the bootstrapping procedure more effectively. It better exploits the valuable information contained in corrupted instances by adapting the weights of both instances and labels. Furthermore, L2B is compatible with existing LNL methods and delivers competitive results spanning natural and medical imaging tasks including classification and segmentation under both synthetic and real-world noise. Extensive experiments demonstrate that our method effectively mitigates the challenges of noisy labels, often necessitating few to no validation samples, and is well generalized to other tasks such as image segmentation. This not only positions it as a robust complement to existing LNL techniques but also underscores its practical applicability. The code and models are available at https://github.com/yuyinzhou/l2b.

下载PDF全文

下载文献需遵守相关版权规定

论文标题