论文标题

Jacobi-Type Newton用于NASH平衡问题的下降保证

A Jacobi-type Newton method for Nash equilibrium problems with descent guarantees

论文作者

Kolossoski, Oliver, Bueno, Luís Felipe, Haeser, Gabriel

论文摘要

与连续变量解决不受限制的两人NASH平衡问题的常见策略是将牛顿的方法应用于通过相应的一阶必需最佳条件获得的非线性方程系统。但是,考虑到游戏动态时,尚不清楚每个玩家在考虑牛顿迭代后正在做出当前决定时的目标。在本文中,我们为牛顿的迭代提供了一种解释,鉴于游戏的动态:而不是最大程度地减少其目标函数的二次近似值,而不是由其他玩家当前的当前决策参数(作为典型的jacobi-type策略),而是我们表明,牛顿迭代会遵循这种方法,而是遵循了目标效应,而是通过对其他策略进行了对其他策略的参数来遵循的策略。这种解释使我们能够提出一种新的牛顿算法,其中引入了回溯过程,以确保每个播放器计算的牛顿方向是其相应参数化功能的下降方向。因此,除了偏爱全局融合外,我们的算法还有利于真正的最小化器,而不是最大化器或马鞍点,这与标准牛顿方法不同,标准牛顿方法不认为在非凸案例中问题的最小化结构。因此,与其他Jacobi型策略或纯牛顿方法相比,我们的方法更强大,这是通过我们的说明性数值实验证实的。我们还提供了在某些标准假设下的算法明确性明确的证明,以及对游戏动力学考虑的融合属性的彻底分析。

A common strategy for solving an unconstrained two-player Nash equilibrium problem with continuous variables is applying Newton's method to the system of nonlinear equations obtained by the corresponding first-order necessary optimality conditions. However, when taking into account the game dynamics, it is not clear what is the goal of each player when considering that they are taking their current decision following Newton's iterates. In this paper we provide an interpretation for Newton's iterate in view of the game dynamics as follows: instead of minimizing the quadratic approximation of their objective function parameterized by the other player current decision (as a typical Jacobi-type strategy), we show that the Newton iterate follows this approach but with the objective function parameterized by a prediction of the other player action, considering that they are following the same Newtonian strategy. This interpretation allows us to present a new Newtonian algorithm where a backtracking procedure is introduced in order to guarantee that the computed Newtonian directions, for each player, are descent directions for their corresponding parameterized functions. Thus, besides favoring global convergence, our algorithm also favors true minimizers instead of maximizers or saddle points, differently from the standard Newton method, which does not consider the minimization structure of the problem in the non-convex case. Thus, our method is more robust in comparison with other Jacobi-type strategies or the pure Newtonian approach, which is corroborated by our illustrative numerical experiments. We also present a proof of the well-definiteness of the algorithm under some standard assumptions, together with a thorough analysis of its convergence properties taking into account the game dynamics.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源