关系替代损失学习

论文标题

关系替代损失学习

Relational Surrogate Loss Learning

论文作者

Huang, Tao, Li, Zekang, Lu, Hua, Shan, Yong, Yang, Shusheng, Feng, Yang, Wang, Fei, You, Shan, Xu, Chang

论文摘要

机器学习中的评估指标通常几乎不被视为损失函数，因为它们可能是不可差的且不可分配的，例如平均精度和F1分数。本文旨在通过重新审视替代损失学习来解决这个问题，在该学习中，使用深层神经网络来近似评估指标。我们没有通过深层的神经网络追求评估指标的精确恢复，而是想起了这些评估指标的存在的目的，即区分一个模型是比另一个模型更好还是更糟。在本文中，我们表明，直接维持替代损失和指标之间模型之间的关系，并提出了一种基于等级相关的优化方法，以最大化这种关系并学习替代损失。与以前的作品相比，我们的方法更容易优化并享有显着的效率和性能提高。广泛的实验表明，我们的方法在包括图像分类和神经机器翻译在内的各种任务上取得了改进，甚至超过了人类姿势估计和机器阅读理解任务的最先进方法。代码可在以下网址获得：https：//github.com/hunto/reloss。

Evaluation metrics in machine learning are often hardly taken as loss functions, as they could be non-differentiable and non-decomposable, e.g., average precision and F1 score. This paper aims to address this problem by revisiting the surrogate loss learning, where a deep neural network is employed to approximate the evaluation metrics. Instead of pursuing an exact recovery of the evaluation metric through a deep neural network, we are reminded of the purpose of the existence of these evaluation metrics, which is to distinguish whether one model is better or worse than another. In this paper, we show that directly maintaining the relation of models between surrogate losses and metrics suffices, and propose a rank correlation-based optimization method to maximize this relation and learn surrogate losses. Compared to previous works, our method is much easier to optimize and enjoys significant efficiency and performance gains. Extensive experiments show that our method achieves improvements on various tasks including image classification and neural machine translation, and even outperforms state-of-the-art methods on human pose estimation and machine reading comprehension tasks. Code is available at: https://github.com/hunto/ReLoss.

下载PDF全文

下载文献需遵守相关版权规定

论文标题