Recursive Regret Matching: A General Method for Solving Time-invariant   Nonlinear Zero-sum Differential Games

Wei Liao; Xiaohui Wei; Jizhou Lai

arXiv:2011.06209·eess.SY·November 13, 2020

Recursive Regret Matching: A General Method for Solving Time-invariant Nonlinear Zero-sum Differential Games

Wei Liao, Xiaohui Wei, Jizhou Lai

PDF

Open Access

TL;DR

This paper introduces Recursive Regret Matching, a novel approach for computing Nash equilibria in time-invariant nonlinear zero-sum differential games, capable of handling cases with or without saddle points.

Contribution

The method transforms differential games into a sequence of normal-form games solved via regret matching, improving real-time computation and avoiding saddle point existence analysis.

Findings

01

Successfully computes Nash equilibria in various game scenarios.

02

Handles cases where saddle points do not exist.

03

Demonstrates effectiveness through illustrative examples.

Abstract

In this paper, a new method is proposed to compute the rolling Nash equilibrium of the time-invariant nonlinear two-person zero-sum differential games. The idea is to discretize the time to transform a differential game into a sequential game with several steps, and by introducing state-value function, transform the sequential game into a recursion consisting of several normal-form games, finally, each normal-form game is solved with action abstraction and regret matching. To improve the real-time property of the proposed method, the state-value function can be kept in memory. This method can deal with the situations that the saddle point exists or does not exist, and the analysises of the existence of the saddle point can be avoided. If the saddle point does not exist, the mixed optimal control pair can be obtained. At the end of this paper, some examples are taken to illustrate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdaptive Dynamic Programming Control · Reinforcement Learning in Robotics · Optimization and Variational Analysis