Specialized Deep Residual Policy Safe Reinforcement Learning-Based   Controller for Complex and Continuous State-Action Spaces

Ammar N. Abbas; Georgios C. Chasparis; and John D. Kelleher

arXiv:2310.14788·cs.LG·October 24, 2023·1 cites

Specialized Deep Residual Policy Safe Reinforcement Learning-Based Controller for Complex and Continuous State-Action Spaces

Ammar N. Abbas, Georgios C. Chasparis, and John D. Kelleher

PDF

Open Access 1 Repo

TL;DR

This paper introduces a specialized deep residual policy reinforcement learning approach that combines traditional controllers with learning-based methods to safely and efficiently operate in complex, continuous environments, demonstrated on process control.

Contribution

It presents a novel hybrid control framework integrating residual policy learning with input-output hidden Markov models for safety and efficiency in continuous spaces.

Findings

01

Effective in complex process control scenarios

02

Enhances safety by focusing exploration around expert trajectories

03

Improves policy optimization in abnormal regions

Abstract

Traditional controllers have limitations as they rely on prior knowledge about the physics of the problem, require modeling of dynamics, and struggle to adapt to abnormal situations. Deep reinforcement learning has the potential to address these problems by learning optimal control policies through exploration in an environment. For safety-critical environments, it is impractical to explore randomly, and replacing conventional controllers with black-box models is also undesirable. Also, it is expensive in continuous state and action spaces, unless the search space is constrained. To address these challenges we propose a specialized deep residual policy safe reinforcement learning with a cycle of learning approach adapted for complex and continuous state-action spaces. Residual policy learning allows learning a hybrid control architecture where the reinforcement learning agent acts in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ammar-n-abbas/CoL-SDRPRL
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Reinforcement Learning in Robotics · Smart Grid Energy Management