Forward-Backward Dynamic Programming for LQG Dynamic Games with Partial and Asymmetric Information

Yuxiang Guan; Iman Shames; Tyler Summers

arXiv:2603.18304·math.OC·March 20, 2026

Forward-Backward Dynamic Programming for LQG Dynamic Games with Partial and Asymmetric Information

Yuxiang Guan, Iman Shames, Tyler Summers

PDF

Open Access

TL;DR

This paper introduces a novel iterative forward-backward algorithm for solving two-player zero-sum stochastic dynamic games with partial and asymmetric information within an LQG framework, addressing belief representation and strategy computation.

Contribution

It develops a new computational approach combining forward-backward and value iteration algorithms for belief and strategy computation in asymmetric information games.

Findings

01

Algorithms effectively compute equilibrium strategies and belief states.

02

Numerical experiments demonstrate the algorithms' practical effectiveness.

03

Open-source implementation is provided for reproducibility.

Abstract

We formulate and study a class of two-player zero-sum stochastic dynamic games with partial and asymmetric information. Information asymmetry introduces fundamental challenges involving \emph{belief representation} and \emph{theory of mind} issues, where agents must impute belief states and estimates of other agents to inform their own strategy. To avoid an infinite regress of higher-order beliefs amongst agents and obtain computationally implementable results, we focus on a linear quadratic Gaussian (LQG) model and consider strategies with limited internal state dimension. We present a novel iterative forward-backward algorithm to jointly compute belief states and equilibrium strategies and value functions for a finite-horizon problem. We also present a value iteration-like algorithm to jointly compute stationary belief states and equilibrium strategies for an average-cost…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Game Theory and Applications