Stagewise Newton Method for Dynamic Game Control with Imperfect State   Observation

Armand Jordana; Bilal Hammoud; Justin Carpentier; Ludovic Righetti

arXiv:2206.11341·math.OC·June 24, 2022·IEEE Control. Syst. Lett.

Stagewise Newton Method for Dynamic Game Control with Imperfect State Observation

Armand Jordana, Bilal Hammoud, Justin Carpentier, Ludovic Righetti

PDF

1 Repo

TL;DR

This paper introduces a low-complexity, iterative Newton-based method for dynamic game control under imperfect state observations, enabling risk-sensitive decision-making with guaranteed convergence.

Contribution

The paper presents a novel, efficient algorithm combining backward and forward recursions for local Nash equilibrium computation in imperfect information settings.

Findings

01

Algorithm has linear complexity in time horizon.

02

Guarantees convergence via merit function and line search.

03

Demonstrates risk-sensitive control in robotic simulations.

Abstract

In this letter, we study dynamic game optimal control with imperfect state observations and introduce an iterative method to find a local Nash equilibrium. The algorithm consists of an iterative procedure combining a backward recursion similar to minimax differential dynamic programming and a forward recursion resembling a risk-sensitive Kalman smoother. A coupling equation renders the resulting control dependent on the estimation. In the end, the algorithm is equivalent to a Newton step but has linear complexity in the time horizon length. Furthermore, a merit function and a line search procedure are introduced to guarantee convergence of the iterative scheme. The resulting controller reasons about uncertainty by planning for the worst case disturbances. Lastly, the low computational cost of the proposed algorithm makes it a promising method to do output-feedback model predictive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

machines-in-motion/dynamic_game_optimizer
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.