论文标题

通过多层次通信的多代理协调

Multi-Agent Coordination via Multi-Level Communication

论文作者

Ding, Ziluo, Liu, Zeyuan, Fang, Zhirui, Su, Kefan, Zhu, Liwen, Lu, Zongqing

论文摘要

可以通过通信访问有关他人的更多信息来减轻多代理设置中的部分可观察性和随机性。但是,由于循环依赖性无法同时交流实际的行动,因此仍然存在协调问题。在本文中,我们提出了一种新型的多层次通信方案,顺序通信(SEQCOMM)。 Seqcomm不同步(高级代理在低级阶段之前做出决定),并有两个通信阶段。在谈判阶段,代理通过传达观测的隐藏状态并比较通过对环境动态进行建模获得的意图价值来确定决策的优先级。在发射阶段,高级代理商领导了做出决策,然后与低级代理商进行交流。从理论上讲,我们证明了Seqcomm所学的政策可以单调改善并融合。从经验上讲,我们表明SEQCOMM在各种合作多代理任务中的现有方法优于现有方法。

The partial observability and stochasticity in multi-agent settings can be mitigated by accessing more information about others via communication. However, the coordination problem still exists since agents cannot communicate actual actions with each other at the same time due to the circular dependencies. In this paper, we propose a novel multi-level communication scheme, Sequential Communication (SeqComm). SeqComm treats agents asynchronously (the upper-level agents make decisions before the lower-level ones) and has two communication phases. In the negotiation phase, agents determine the priority of decision-making by communicating hidden states of observations and comparing the value of intention, obtained by modeling the environment dynamics. In the launching phase, the upper-level agents take the lead in making decisions and then communicate their actions with the lower-level agents. Theoretically, we prove the policies learned by SeqComm are guaranteed to improve monotonically and converge. Empirically, we show that SeqComm outperforms existing methods in various cooperative multi-agent tasks.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源