论文标题

有罪厌恶的心理理论有助于合作加强学习

Theory of Mind with Guilt Aversion Facilitates Cooperative Reinforcement Learning

论文作者

Nguyen, Dung, Venkatesh, Svetha, Nguyen, Phuoc, Tran, Truyen

论文摘要

如果他们认为他们失望的人失望,这会导致人们对公用事业损失的经历,这促进了人类的合作行为。在心理游戏理论中,内gui感厌恶需要对具有其他代理商认为的理论(也称为心理理论)的理论进行建模(汤姆)。我们旨在建立一种新型的情感加强学习者,称为“心灵厌恶理论”(Tomaga),它具有思考他人的福祉而不是自我利益的能力。为了验证代理设计,我们使用称为雄鹿狩猎的通用游戏作为测试床。作为标准的强化学习者可以学习社会困境等社会困境中的次优政策,我们建议将基于信念的内厌恶作为奖励塑造机制。我们表明,我们基于信念的罪恶感代理人可以在雄鹿狩猎游戏中有效学习合作行为。

Guilt aversion induces experience of a utility loss in people if they believe they have disappointed others, and this promotes cooperative behaviour in human. In psychological game theory, guilt aversion necessitates modelling of agents that have theory about what other agents think, also known as Theory of Mind (ToM). We aim to build a new kind of affective reinforcement learning agents, called Theory of Mind Agents with Guilt Aversion (ToMAGA), which are equipped with an ability to think about the wellbeing of others instead of just self-interest. To validate the agent design, we use a general-sum game known as Stag Hunt as a test bed. As standard reinforcement learning agents could learn suboptimal policies in social dilemmas like Stag Hunt, we propose to use belief-based guilt aversion as a reward shaping mechanism. We show that our belief-based guilt averse agents can efficiently learn cooperative behaviours in Stag Hunt Games.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源