Risk-Aware Safe Reinforcement Learning for Control of Stochastic Linear Systems
Babak Esmaeili, Nariman Niknejad, Hamidreza Modares

TL;DR
This paper introduces a risk-aware safe reinforcement learning framework for stochastic linear systems that combines a safe controller with an RL controller, ensuring high-confidence safety and efficient control without relying on detailed system models.
Contribution
It proposes a novel method to learn safe controllers alongside RL controllers, using data-driven optimization and interpolation techniques to enhance safety and reduce data requirements.
Findings
High-confidence safety certification without high-fidelity models
Avoidance of myopic interventions and undesired equilibria
Reduced data needs and variance in safety violations
Abstract
This paper presents a risk-aware safe reinforcement learning (RL) control design for stochastic discrete-time linear systems. Rather than using a safety certifier to myopically intervene with the RL controller, a risk-informed safe controller is also learned besides the RL controller, and the RL and safe controllers are combined together. Several advantages come along with this approach: 1) High-confidence safety can be certified without relying on a high-fidelity system model and using limited data available, 2) Myopic interventions and convergence to an undesired equilibrium can be avoided by deciding on the contribution of two stabilizing controllers, and 3) highly efficient and computationally tractable solutions can be provided by optimizing over a scalar decision variable and linear programming polyhedral sets. To learn safe controllers with a large invariant set, piecewise affine…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExtremum Seeking Control Systems · Fault Detection and Control Systems · Traffic control and management
