Distributionally Robust Regret Optimal LQR with Common Stage-Law Ambiguity
Lukas-Benedikt Fiechtner, Jose Blanchet

TL;DR
This paper introduces a tractable distributionally robust regret optimization framework for finite-horizon LQR with common stage-law ambiguity, providing a semidefinite programming reformulation and practical insights.
Contribution
It develops the first tractable multistage DRRO formulation for stochastic control with stage-law ambiguity, extending LQR control with explicit regret guarantees.
Findings
DRRO often less conservative than DRO under same ambiguity set
Optimal controller combines nominal LQR law with a causal empirical-mean correction
Numerical results confirm empirical correction coefficients approach certainty-equivalent law
Abstract
We study, to our knowledge, the first tractable multistage ex-ante distributionally robust regret optimization (DRRO) formulation for stochastic control. We consider finite-horizon LQR under common stage-law ambiguity: disturbances are independent across time but share an unknown stage law whose mean and covariance lie in a Gelbrich ball around nominal parameters. Unlike the single-stage quadratic case, the nominal certainty-equivalent (CE) controller is generally not regret-optimal, because reuse of the stage law makes past disturbances informative for future decisions. Despite the general NP-hardness of DRRO, we show that over linear disturbance-feedback policies the resulting multistage DRRO-LQR problem admits an exact semidefinite programming reformulation. The optimal controller is the nominal certainty-equivalent LQR law plus a strictly causal empirical-mean correction. We also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
