论文标题
平衡分析和控制决策人群
Equilibration Analysis and Control of Coordinating Decision-Making Populations
论文作者
论文摘要
一群决策者是否会达到令人满意的决策状态,这是研究集体行为的基本问题。在进化游戏理论和潜在功能的框架内,研究人员通过对代理的实用程序功能强加某些条件,在不同的更新规则(包括最佳响应和模仿)下建立了平衡收敛。然后,通过使用所提出的电势函数,他们能够将这些人群控制在某些所需的平衡方面。然而,即使不是不可能,找到潜在的功能通常令人生畏。我们介绍了所谓的协调代理,只有至少另一个代理这样做的时候才倾向于切换做出决定。我们证明,任何协调药的人口,人口协调,几乎可以肯定平衡。显然,某些被证明使用潜在功能平衡的二进制网络游戏正在协调,并且可以使用此概念来解决某些着色问题。我们还表明,在最佳响应,模仿或合理模仿之后的任何混合代理网络以及与协调收益矩阵相关联的混合网络都是协调的,因此,平衡。作为第二个贡献,我们提供了一种基于激励的控制算法,该算法将协调人群达到所需的平衡。该算法迭代地最大化了选择所需决定与提供的激励措施的代理数量的比率。它的性能几乎是最佳和最佳响应和模仿的专业算法;但是,它不需要潜在的功能。因此,在尚未发现给定决策人群尚未发现潜在功能的一般情况下,可以轻松地应用该控制算法。
Whether a population of decision-making individuals will reach a state of satisfactory decisions is a fundamental problem in studying collective behaviors. In the framework of evolutionary game theory and by means of potential functions, researchers have established equilibrium convergence under different update rules, including best-response and imitation, by imposing certain conditions on agents' utility functions. Then by using the proposed potential functions, they have been able to control these populations towards some desired equilibrium. Nevertheless, finding a potential function is often daunting, if not near impossible. We introduce the so-called coordinating agent who tends to switch to a decision only if at least another agent has done so. We prove that any population of coordinating agents, a coordinating population, almost surely equilibrates. Apparently, some binary network games that were proven to equilibrate using potential functions are coordinating, and some coloring problems can be solved using this notion. We additionally show that any mixed network of agents following best-response, imitation, or rational imitation, and associated with coordination payoff matrices is coordinating, and hence, equilibrates. As a second contribution, we provide an incentive-based control algorithm that leads coordinating populations to a desired equilibrium. The algorithm iteratively maximizes the ratio of the number of agents choosing the desired decision to the provided incentive. It performs near optimal and as well as specialized algorithms proposed for best-response and imitation; however, it does not require a potential function. Therefore, this control algorithm can be readily applied in general situations where no potential function is yet found for a given decision-making population.