CMW-NET：学习班级意识的样本加权映射，以进行健壮的深度学习

论文标题

CMW-NET：学习班级意识的样本加权映射，以进行健壮的深度学习

CMW-Net: Learning a Class-Aware Sample Weighting Mapping for Robust Deep Learning

论文作者

Shu, Jun, Yuan, Xiang, Meng, Deyu, Xu, Zongben

论文摘要

现代深层神经网络可以轻松地过度融合到包含损坏标签或阶级失衡的有偏见的培训数据。样本重新加权方法通常用于减轻此数据偏差问题。但是，大多数当前的方法都需要手动预先指定加权方案，以及依赖于研究的问题和培训数据的特征的其他超参数。这使得它们很难在实际场景中通常应用于数据偏差情况的显着复杂性和类间变化。为了解决这个问题，我们提出了一种能够直接从数据中自适应学习明确的加权方案的元模型。具体而言，通过将每个培训类别视为一个单独的学习任务，我们的方法旨在提取具有样本丢失和任务/类功能作为输入的明确的加权功能，而样本权重作为输出，期望根据自己的内在偏见特征对不同的样本类别施加自适应变化的权重方案。合成和真实数据实验证实了我们方法在各种数据偏差案例中实现适当加权方案的能力，例如类不平衡，独立和依赖性标签噪声场景以及更复杂的偏置场景以外的传统案例。此外，还可以通过轻松部署在相对较小的CIFAR-10数据集中学到的加权功能来证实学习加权方案的任务转移性。与以前的SOAT相比，可以很容易地获得性能增长，而没有其他高参数调整和元梯度下降步骤。我们对于多种健壮的深度学习问题的总体可用性，包括部分标签学习，半监督学习和选择性分类，也已得到验证。

Modern deep neural networks can easily overfit to biased training data containing corrupted labels or class imbalance. Sample re-weighting methods are popularly used to alleviate this data bias issue. Most current methods, however, require to manually pre-specify the weighting schemes as well as their additional hyper-parameters relying on the characteristics of the investigated problem and training data. This makes them fairly hard to be generally applied in practical scenarios, due to their significant complexities and inter-class variations of data bias situations. To address this issue, we propose a meta-model capable of adaptively learning an explicit weighting scheme directly from data. Specifically, by seeing each training class as a separate learning task, our method aims to extract an explicit weighting function with sample loss and task/class feature as input, and sample weight as output, expecting to impose adaptively varying weighting schemes to different sample classes based on their own intrinsic bias characteristics. Synthetic and real data experiments substantiate the capability of our method on achieving proper weighting schemes in various data bias cases, like the class imbalance, feature-independent and dependent label noise scenarios, and more complicated bias scenarios beyond conventional cases. Besides, the task-transferability of the learned weighting scheme is also substantiated, by readily deploying the weighting function learned on relatively smaller-scale CIFAR-10 dataset on much larger-scale full WebVision dataset. A performance gain can be readily achieved compared with previous SOAT ones without additional hyper-parameter tuning and meta gradient descent step. The general availability of our method for multiple robust deep learning issues, including partial-label learning, semi-supervised learning and selective classification, has also been validated.

下载PDF全文

下载文献需遵守相关版权规定

论文标题