On the Design of Safe Continual RL Methods for Control of Nonlinear   Systems

Austin Coursey; Marcos Quinones-Grueiro; Gautam Biswas

arXiv:2502.15922·cs.LG·February 25, 2025

On the Design of Safe Continual RL Methods for Control of Nonlinear Systems

Austin Coursey, Marcos Quinones-Grueiro, Gautam Biswas

PDF

Open Access 1 Repo

TL;DR

This paper investigates the challenges of integrating safety constraints into continual reinforcement learning for nonlinear control systems, highlighting limitations of existing methods and proposing a reward-shaping approach to improve safety and task retention.

Contribution

It identifies safety and continual learning issues in existing algorithms and introduces a reward-shaping method to enhance safety preservation in nonlinear, non-stationary systems.

Findings

01

Online elastic weight consolidation fails to ensure safety in nonlinear systems.

02

Constrained policy optimization suffers from catastrophic forgetting.

03

Reward shaping improves safety and task retention in continual RL.

Abstract

Reinforcement learning (RL) algorithms have been successfully applied to control tasks associated with unmanned aerial vehicles and robotics. In recent years, safe RL has been proposed to allow the safe execution of RL algorithms in industrial and mission-critical systems that operate in closed loops. However, if the system operating conditions change, such as when an unknown fault occurs in the system, typical safe RL algorithms are unable to adapt while retaining past knowledge. Continual reinforcement learning algorithms have been proposed to address this issue. However, the impact of continual adaptation on the system's safety is an understudied problem. In this paper, we study the intersection of safe and continual RL. First, we empirically demonstrate that a popular continual RL algorithm, online elastic weight consolidation, is unable to satisfy safety constraints in non-linear…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

MACS-Research-Lab/safe-continual
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsControl Systems and Identification · Stability and Control of Uncertain Systems · Advanced Control Systems Design