从域适应性的角度重新考虑长尾视觉识别的类平衡方法

论文标题

从域适应性的角度重新考虑长尾视觉识别的类平衡方法

Rethinking Class-Balanced Methods for Long-Tailed Visual Recognition from a Domain Adaptation Perspective

论文作者

Jamal, Muhammad Abdullah, Brown, Matthew, Yang, Ming-Hsuan, Wang, Liqiang, Gong, Boqing

论文摘要

现实世界中的对象频率通常遵循幂定律，从而导致机器学习模型看到的长尾类别分布的数据集之间的不匹配以及我们对模型在所有类别上都表现良好的期望。我们从域适应的角度分析了这一不匹配。首先，我们将现有的类平衡方法连接到用于长尾分类的现有平衡方法与目标变化，这是域适应性的良好情况。连接表明，这些方法隐含地假设训练数据和测试数据共享相同的类调节分布，这通常不存在，尤其是对于尾巴类别。虽然头等舱可能包含丰富而多样化的培训示例，这些例子很好地代表了推理时间的预期数据，但尾巴课程通常没有代表性的培训数据。为此，我们建议通过使用元学习方法明确估算班级条件分布之间的差异来增强经典的班级平衡学习。我们使用六个基准数据集和三个损失功能来验证我们的方法。

Object frequency in the real world often follows a power law, leading to a mismatch between datasets with long-tailed class distributions seen by a machine learning model and our expectation of the model to perform well on all classes. We analyze this mismatch from a domain adaptation point of view. First of all, we connect existing class-balanced methods for long-tailed classification to target shift, a well-studied scenario in domain adaptation. The connection reveals that these methods implicitly assume that the training data and test data share the same class-conditioned distribution, which does not hold in general and especially for the tail classes. While a head class could contain abundant and diverse training examples that well represent the expected data at inference time, the tail classes are often short of representative training data. To this end, we propose to augment the classic class-balanced learning by explicitly estimating the differences between the class-conditioned distributions with a meta-learning approach. We validate our approach with six benchmark datasets and three loss functions.

下载PDF全文

下载文献需遵守相关版权规定

论文标题