A Satisficing Control Design Framework with Safety and Performance Guarantees for Constrained Systems under Disturbances
Yuzhen Han, Hamidreza Modares

TL;DR
This paper introduces a safe robust policy iteration algorithm that guarantees safety and satisfactory performance for constrained systems under disturbances, balancing safety constraints with performance goals.
Contribution
It develops a unified framework combining safety certification via control barrier functions with performance guarantees through Bellman inequalities, using a satisficing approach.
Findings
Guarantees robust safety and performance at each iteration.
Employs sum of squares programming for implementation.
Numerical simulations validate the framework.
Abstract
This paper presents a safe robust policy iteration (SR-PI) algorithm to design controllers with satisficing (good enough) performance and safety guarantee. This is in contrast to standard PI-based control design methods with no safety certification. It also moves away from existing safe control design approaches that perform pointwise optimization and are thus myopic. Safety assurance requires satisfying a control barrier function (CBF), which might be in conflict with the performance-driven Lyapunov solution to the Bellman equation arising in each iteration of the PI. Therefore, a new development is required to robustly certify the safety of an improved policy at each iteration of the PI. The proposed SR-PI algorithm unifies performance guarantee (provided by a Bellman inequality) with safety guarantee (provided by a robust CBF) at each iteration. The Bellman inequality resembles the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdaptive Dynamic Programming Control · Adaptive Control of Nonlinear Systems · Reinforcement Learning in Robotics
