通过隐式sindhorn差异重新巴辛

论文标题

通过隐式sindhorn差异重新巴辛

Re-basin via implicit Sinkhorn differentiation

论文作者

Peña, Fidel A. Guerrero, Medeiros, Heitor Rapela, Dubail, Thomas, Aminbeidokhti, Masih, Granger, Eric, Pedersoli, Marco

论文摘要

新的新算法将模型置于解决方案空间的功能等效区域的最新出现使误差表面的复杂性以及一些有希望的属性（例如模式连接性）揭示了一些启示。但是，找到正确的置换是具有挑战性的，当前的优化技术是无法差异的，这使得很难集成到基于梯度的优化中，并且通常会导致次优的解决方案。在本文中，我们提出了一个sndhorn re-basin网络，具有获得更适合给定目标的运输计划的能力。与当前的最新方法不同，我们的方法是可区分的，因此易于适应深度学习领域内的任何任务。此外，我们通过提出一种新的成本函数来展示我们的重生方法的优势，该功能允许通过利用线性模式连接属性来执行增量学习。在最佳传输发现和线性模式连接性的几种条件下，将我们方法的好处与文献的类似方法进行了比较。对于几个常见的基准数据集，还显示了基于重生的连续学习方法的有效性，从而提供了与文献的最先进结果竞争的实验结果。

The recent emergence of new algorithms for permuting models into functionally equivalent regions of the solution space has shed some light on the complexity of error surfaces, and some promising properties like mode connectivity. However, finding the right permutation is challenging, and current optimization techniques are not differentiable, which makes it difficult to integrate into a gradient-based optimization, and often leads to sub-optimal solutions. In this paper, we propose a Sinkhorn re-basin network with the ability to obtain the transportation plan that better suits a given objective. Unlike the current state-of-art, our method is differentiable and, therefore, easy to adapt to any task within the deep learning domain. Furthermore, we show the advantage of our re-basin method by proposing a new cost function that allows performing incremental learning by exploiting the linear mode connectivity property. The benefit of our method is compared against similar approaches from the literature, under several conditions for both optimal transport finding and linear mode connectivity. The effectiveness of our continual learning method based on re-basin is also shown for several common benchmark datasets, providing experimental results that are competitive with state-of-art results from the literature.

下载PDF全文

下载文献需遵守相关版权规定

论文标题