Specialized Deep Residual Policy Safe Reinforcement Learning-Based Controller for Complex and Continuous State-Action Spaces
Ammar N. Abbas, Georgios C. Chasparis, and John D. Kelleher

TL;DR
This paper introduces a specialized deep residual policy reinforcement learning approach that combines traditional controllers with learning-based methods to safely and efficiently operate in complex, continuous environments, demonstrated on process control.
Contribution
It presents a novel hybrid control framework integrating residual policy learning with input-output hidden Markov models for safety and efficiency in continuous spaces.
Findings
Effective in complex process control scenarios
Enhances safety by focusing exploration around expert trajectories
Improves policy optimization in abnormal regions
Abstract
Traditional controllers have limitations as they rely on prior knowledge about the physics of the problem, require modeling of dynamics, and struggle to adapt to abnormal situations. Deep reinforcement learning has the potential to address these problems by learning optimal control policies through exploration in an environment. For safety-critical environments, it is impractical to explore randomly, and replacing conventional controllers with black-box models is also undesirable. Also, it is expensive in continuous state and action spaces, unless the search space is constrained. To address these challenges we propose a specialized deep residual policy safe reinforcement learning with a cycle of learning approach adapted for complex and continuous state-action spaces. Residual policy learning allows learning a hybrid control architecture where the reinforcement learning agent acts in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Control Systems Optimization · Reinforcement Learning in Robotics · Smart Grid Energy Management
