Adaptive Distributionally Robust Optimal Control with Bayesian Ambiguity Sets
Wentao Ma, Zhiping Chen, Huifu Xu, Enlu Zhou

TL;DR
This paper introduces an adaptive distributionally robust optimal control model that updates ambiguity sets via Bayesian learning, providing a less conservative, data-efficient approach with theoretical guarantees and practical effectiveness demonstrated through inventory control experiments.
Contribution
It develops a novel Bayesian-based adaptive DROC framework with a tractable reformulation, convergence guarantees, and a new algorithm for episodic data scenarios.
Findings
The model achieves robust performance with limited data.
The BOCP algorithm converges efficiently and reliably.
Numerical results show improved adaptivity and robustness.
Abstract
In stochastic optimal control (SOC), uncertainty may arise from incomplete knowledge of the true probability distribution of the underlying environment, which is known as Knightian or epistemic uncertainty. Distributionally robust optimal control (DROC) models are subsequently proposed to tackle this source of uncertainty. While such models are effective in some practical applications, most existing DROC models are offline and can be overly conservative when data are scarce. Moreover, they cannot be applied to the case when samples are generated episodically. Motivated by the Bayesian SOC framework recently proposed by Shapiro et al.~\cite{shapiro2025episodic}, we propose an adaptive DROC model in which the ambiguity set is updated via Bayesian learning from new data. Under some moderate conditions, we derive a tractable risk-averse reformulation, establish consistency of the optimal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
