Safe Chance Constrained Reinforcement Learning for Batch Process Control
Max Mowbray, Panagiotis Petsagkourakis, Ehecatl Antonio del R\'io, Chanona, Dongda Zhang

TL;DR
This paper introduces a data-driven safe reinforcement learning approach using Gaussian processes to handle plant-model mismatch and ensure joint chance constraint satisfaction in batch process control.
Contribution
It extends existing safe RL methods by incorporating Gaussian process-based uncertainty to address plant-model mismatch in process control.
Findings
Successfully accounts for process uncertainty and plant-model mismatch
Enables satisfaction of joint chance constraints in case studies
Outperforms traditional nonlinear model predictive control
Abstract
Reinforcement Learning (RL) controllers have generated excitement within the control community. The primary advantage of RL controllers relative to existing methods is their ability to optimize uncertain systems independently of explicit assumption of process uncertainty. Recent focus on engineering applications has been directed towards the development of safe RL controllers. Previous works have proposed approaches to account for constraint satisfaction through constraint tightening from the domain of stochastic model predictive control. Here, we extend these approaches to account for plant-model mismatch. Specifically, we propose a data-driven approach that utilizes Gaussian processes for the offline simulation model and use the associated posterior uncertainty prediction to account for joint chance constraints and plant-model mismatch. The method is benchmarked against nonlinear…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Control Systems Optimization · Fault Detection and Control Systems · Process Optimization and Integration
