论文标题

与动作误解的可及性游戏中欺骗性策略的合成

Synthesis of Deceptive Strategies in Reachability Games with Action Misperception

论文作者

Kulkarni, Abhishek N., Fu, Jie

论文摘要

我们考虑具有可及性目标(称为可触及性游戏)的图表上的一类基于两人转弯的零和游戏,其中玩家1(P1)的目标是达到一组目标状态,而玩家2(P2)的目标是防止这一点。特别是,我们考虑了玩家对彼此的动作功能的不对称信息的情况:P2以不完整的信息(误解)开始有关P1的动作集,并在P1使用以前不知道P2的动作时更新误解。当P1意识到P2的误解时,关键问题是P1是否可以控制P2的感知以欺骗P2以选择P1的优势行动?我们表明,P1可能存在一种欺骗性的获胜策略,可确保在国家的对称和完整的情况下,从一个州否则损失的国家可能会损失P1的目标。我们提出三个关键结果:首先,我们引入了动态的超级游戏模型,以捕获P2的不断发展的误解。其次,我们提出了一种固定点算法,以计算欺骗性的几乎持续的获胜(DASW)和DASW策略。最后,我们表明,DASW策略至少与游戏中P1的差异(ASW)策略一样强大,其中P1无法解决P2的误解。我们使用在对抗环境中使用机器人运动计划来说明我们的算法。

We consider a class of two-player turn-based zero-sum games on graphs with reachability objectives, known as reachability games, where the objective of Player 1 (P1) is to reach a set of goal states, and that of Player 2 (P2) is to prevent this. In particular, we consider the case where the players have asymmetric information about each other's action capabilities: P2 starts with an incomplete information (misperception) about P1's action set, and updates the misperception when P1 uses an action previously unknown to P2. When P1 is made aware of P2's misperception, the key question is whether P1 can control P2's perception so as to deceive P2 into selecting actions to P1's advantage? We show that there might exist a deceptive winning strategy for P1 that ensures P1's objective is achieved with probability one from a state otherwise losing for P1, had the information being symmetric and complete. We present three key results: First, we introduce a dynamic hypergame model to capture the reachability game with evolving misperception of P2. Second, we present a fixed-point algorithm to compute the Deceptive Almost-Sure Winning (DASW) region and DASW strategy. Finally, we show that DASW strategy is at least as powerful as Almost-Sure Winning (ASW) strategy in the game in which P1 does not account for P2's misperception. We illustrate our algorithm using a robot motion planning in an adversarial environment.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源