测量和利用多任务学习的转移

论文标题

测量和利用多任务学习的转移

Measuring and Harnessing Transference in Multi-Task Learning

论文作者

Fifty, Christopher, Amid, Ehsan, Zhao, Zhe, Yu, Tianhe, Anil, Rohan, Finn, Chelsea

论文摘要

多任务学习可以利用一项任务学习的信息，以使其他任务的培训受益。尽管有这种能力，但天真的配方经常降低性能，尤其是确定将受益于共同训练的任务仍然是一个具有挑战性的设计问题。在本文中，我们在整个培训中分析了跨任务的信息传输或转移的动态。具体而言，我们开发了一个相似性度量，可以量化任务之间的转移，并使用此数量来更好地了解多任务学习的优化动态，并提高整体学习绩效。在后一种情况下，我们提出了两种方法来利用我们的转移指标。第一个通过选择哪些任务应一起训练，而第二个任务在微观层面上进行训练，以在每个训练步骤中组合任务梯度，以在宏观级别运行。我们发现这些方法可以在三个有监督的多任务学习基准和一个多任务加强学习范式上对先前的工作进行显着改善。

Multi-task learning can leverage information learned by one task to benefit the training of other tasks. Despite this capacity, naive formulations often degrade performance and in particular, identifying the tasks that would benefit from co-training remains a challenging design question. In this paper, we analyze the dynamics of information transfer, or transference, across tasks throughout training. Specifically, we develop a similarity measure that can quantify transference among tasks and use this quantity to both better understand the optimization dynamics of multi-task learning as well as improve overall learning performance. In the latter case, we propose two methods to leverage our transference metric. The first operates at a macro-level by selecting which tasks should train together while the second functions at a micro-level by determining how to combine task gradients at each training step. We find these methods can lead to significant improvement over prior work on three supervised multi-task learning benchmarks and one multi-task reinforcement learning paradigm.

下载PDF全文

下载文献需遵守相关版权规定

论文标题