DIFMAML：分散的多代理元学习

论文标题

DIFMAML：分散的多代理元学习

Dif-MAML: Decentralized Multi-Agent Meta-Learning

论文作者

Kayaalp, Mert, Vlaski, Stefan, Sayed, Ali H.

论文摘要

元学习的目的是利用从观察到的任务中获得的知识，以改善对看不见的任务的适应性。因此，元学习者在接受大量观察到的任务和每个任务的大量数据培训时能够更好地概括。考虑到所需的资源量，通常很难期望任务，它们各自的数据以及在单个中央位置可用的必要计算能力。遇到这些资源分布在通过某些图形拓扑连接的几个代理之间的情况更为自然。元学习的形式主义实际上非常适合这种分散的环境，在该环境中，学习者将能够从遍布代理商的信息和计算能力中受益。在这项工作中，在这项工作中，我们提出了一种合作的完全截然不同的多代理元学习算法，称为基于扩散的MAML或DIFMAML。就可伸缩性，避免通信瓶颈和隐私保证而言，分散的优化算法优于集中实现。这项工作提供了详细的理论分析，以表明所提出的策略允许一组代理以线性速率达成协议，即使在非convex环境中，也可以将MAML目标的固定点收集到固定点。模拟结果说明了相对于传统的非合作环境的理论发现和出色的性能。

The objective of meta-learning is to exploit the knowledge obtained from observed tasks to improve adaptation to unseen tasks. As such, meta-learners are able to generalize better when they are trained with a larger number of observed tasks and with a larger amount of data per task. Given the amount of resources that are needed, it is generally difficult to expect the tasks, their respective data, and the necessary computational capacity to be available at a single central location. It is more natural to encounter situations where these resources are spread across several agents connected by some graph topology. The formalism of meta-learning is actually well-suited to this decentralized setting, where the learner would be able to benefit from information and computational power spread across the agents. Motivated by this observation, in this work, we propose a cooperative fully-decentralized multi-agent meta-learning algorithm, referred to as Diffusion-based MAML or Dif-MAML. Decentralized optimization algorithms are superior to centralized implementations in terms of scalability, avoidance of communication bottlenecks, and privacy guarantees. The work provides a detailed theoretical analysis to show that the proposed strategy allows a collection of agents to attain agreement at a linear rate and to converge to a stationary point of the aggregate MAML objective even in non-convex environments. Simulation results illustrate the theoretical findings and the superior performance relative to the traditional non-cooperative setting.

下载PDF全文

下载文献需遵守相关版权规定

论文标题