Safe Model-Based Reinforcement Learning for Systems with Parametric   Uncertainties

S M Nahid Mahmud; Scott A Nivison; Zachary I. Bell; Rushikesh; Kamalapurkar

arXiv:2007.12666·eess.SY·October 6, 2021

Safe Model-Based Reinforcement Learning for Systems with Parametric Uncertainties

S M Nahid Mahmud, Scott A Nivison, Zachary I. Bell, Rushikesh, Kamalapurkar

PDF

Open Access

TL;DR

This paper introduces a safe model-based reinforcement learning method for deterministic nonlinear systems with parametric uncertainties, enabling learning of constrained optimal policies without strict excitation requirements.

Contribution

It develops a novel filtered concurrent learning approach combined with barrier transformation to learn unknown parameters and control policies simultaneously.

Findings

01

Successfully learns approximate constrained optimal policies.

02

Handles parametric uncertainties without strict excitation.

03

Ensures safety in safety-critical systems.

Abstract

Reinforcement learning has been established over the past decade as an effective tool to find optimal control policies for dynamical systems, with recent focus on approaches that guarantee safety during the learning and/or execution phases. In general, safety guarantees are critical in reinforcement learning when the system is safety-critical and/or task restarts are not practically feasible. In optimal control theory, safety requirements are often expressed in terms of state and/or control constraints. In recent years, reinforcement learning approaches that rely on persistent excitation have been combined with a barrier transformation to learn the optimal control policies under state constraints. To soften the excitation requirements, model-based reinforcement learning methods that rely on exact model knowledge have also been integrated with the barrier transformation framework. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMechanical Circulatory Support Devices · Fuel Cells and Related Materials