Linear-Quadratic Discrete-Time Dynamic Games with Unknown Dynamics
Shengyuan Huang, Xiaoguang Yang, Zhigang Cao, Wenjun Mei

TL;DR
This paper develops a data-driven framework for solving finite and infinite-horizon linear-quadratic discrete-time dynamic games with unknown dynamics, providing conditions for feedback Nash equilibria and algorithms for their computation.
Contribution
It introduces necessary and sufficient conditions for FNE existence using offline data and proposes algorithms for computing equilibria without known system models.
Findings
FNEs of unknown and known dynamics games are equivalent.
An invertibility condition simplifies FNE computation.
Finite-horizon strategies converge to infinite-horizon solutions.
Abstract
Considering linear-quadratic discrete-time games with unknown input/output/state (i/o/s) dynamics and state, we provide necessary and sufficient conditions for the existence and uniqueness of feedback Nash equilibria (FNE) in the finite-horizon game, based entirely on offline input/output data. We prove that the finite-horizon unknown-dynamics game and its corresponding known-dynamics game have the same FNEs, and provide detailed relationships between their respective FNE matrices. To simplify the computation of FNEs, we provide an invertibility condition and a corresponding algorithm that computes one FNE by solving a finite number of linear equation systems using offline data. For the infinite-horizon unknown-dynamics game, limited offline data restricts players to computing optimal strategies only over a finite horizon. We prove that the finite-horizon strategy ``watching steps…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdaptive Dynamic Programming Control · Advanced Control Systems Optimization · Stability and Control of Uncertain Systems
