Predictive Compensation in Finite-Horizon LQ Games under Gauss-Markov Deviations
Navid Mojahed, Mahdis Rabbani, Shima Nazari

TL;DR
This paper introduces a predictive compensation approach for finite-horizon linear quadratic games with Gauss-Markov deviations, enhancing robustness and reducing costs through anticipatory strategies.
Contribution
It develops a closed-form recursive framework for predictive compensation in dynamic games with correlated disturbances, extending Nash strategies to account for stochastic deviations.
Findings
Predictive compensation reduces expected costs in disturbed environments.
The framework provides boundedness and sensitivity analysis of the equilibrium.
Numerical results show performance improvements over traditional feedback strategies.
Abstract
This paper develops a predictive compensation framework for finite-horizon, discrete-time linear quadratic dynamic games subject to Gauss-Markov execution deviations from feedback Nash strategies. One player's control is corrupted by temporally correlated stochastic perturbations modeled as a first-order autoregressive (AR(1)) process, while the opposing player has causal access to past deviations and employs a predictive feedforward strategy that anticipates their future effect. We derive closed-form recursions for mean and covariance propagation under the resulting perturbed closed loop, establish boundedness and sensitivity properties of the equilibrium trajectory, and characterize the reduction in expected cost achieved by optimal predictive compensation. Numerical experiments corroborate the theoretical results and demonstrate performance gains relative to nominal Nash feedback…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Game Theory and Applications · Adaptive Dynamic Programming Control
