A PAC-Bayesian Framework for Optimal Control with Stability Guarantees
Mahrokh Ghoddousi Boroujeni, Clara Luc\'ia Galimberti, Andreas Krause,, Giancarlo Ferrari-Trecate

TL;DR
This paper introduces a PAC-Bayesian framework for stochastic nonlinear optimal control that provides generalization guarantees and stability, improving controller design especially with limited data.
Contribution
It develops a novel PAC-Bayesian approach to bound true control costs, enabling stable, data-efficient controller synthesis incorporating prior knowledge.
Findings
Provides rigorous generalization bounds for SNOC policies.
Designs neural network controllers with stability guarantees.
Demonstrates improved control performance in cooperative robotics.
Abstract
Stochastic Nonlinear Optimal Control (SNOC) involves minimizing a cost function that averages out the random uncertainties affecting the dynamics of nonlinear systems. For tractability reasons, this problem is typically addressed by minimizing an empirical cost, which represents the average cost across a finite dataset of sampled disturbances. However, this approach raises the challenge of quantifying the control performance against out-of-sample uncertainties. Particularly, in scenarios where the training dataset is small, SNOC policies are prone to overfitting, resulting in significant discrepancies between the empirical cost and the true cost, i.e., the average SNOC cost incurred during control deployment. Therefore, establishing generalization bounds on the true cost is crucial for ensuring reliability in real-world applications. In this paper, we introduce a novel approach that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Control Systems Optimization · Reservoir Engineering and Simulation Methods · Fault Detection and Control Systems
