Safe Controller for Output Feedback Linear Systems using Model-Based   Reinforcement Learning

S M Nahid Mahmud; Moad Abudia; Scott A Nivison; Zachary I. Bell,; Rushikesh Kamalapurkar

arXiv:2204.01409·eess.SY·April 5, 2022

Safe Controller for Output Feedback Linear Systems using Model-Based Reinforcement Learning

S M Nahid Mahmud, Moad Abudia, Scott A Nivison, Zachary I. Bell,, Rushikesh Kamalapurkar

PDF

Open Access

TL;DR

This paper introduces a novel output-feedback safe reinforcement learning method for linear systems, enabling safe policy learning without full state feedback, demonstrated through simulation results.

Contribution

It presents a barrier-aware dynamic state estimator that allows safe reinforcement learning using output feedback, expanding applicability in real-world safety-critical systems.

Findings

01

Barrier transformation effectively enables online reinforcement learning.

02

The proposed method ensures safety during learning in simulation.

03

Output feedback suffices for safe control policy development.

Abstract

The objective of this research is to enable safety-critical systems to simultaneously learn and execute optimal control policies in a safe manner to achieve complex autonomy. Learning optimal policies via trial and error, i.e., traditional reinforcement learning, is difficult to implement in safety-critical systems, particularly when task restarts are unavailable. Safe model-based reinforcement learning techniques based on a barrier transformation have recently been developed to address this problem. However, these methods rely on full state feedback, limiting their usability in a real-world environment. In this work, an output-feedback safe model-based reinforcement learning technique based on a novel barrier-aware dynamic state estimator has been designed to address this issue. The developed approach facilitates simultaneous learning and execution of safe control policies for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Reliability and Analysis Research · Reinforcement Learning in Robotics · Smart Grid Security and Resilience