Safe Chance Constrained Reinforcement Learning for Batch Process Control

Max Mowbray; Panagiotis Petsagkourakis; Ehecatl Antonio del R\'io; Chanona; Dongda Zhang

arXiv:2104.11706·cs.LG·December 7, 2021

Safe Chance Constrained Reinforcement Learning for Batch Process Control

Max Mowbray, Panagiotis Petsagkourakis, Ehecatl Antonio del R\'io, Chanona, Dongda Zhang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a data-driven safe reinforcement learning approach using Gaussian processes to handle plant-model mismatch and ensure joint chance constraint satisfaction in batch process control.

Contribution

It extends existing safe RL methods by incorporating Gaussian process-based uncertainty to address plant-model mismatch in process control.

Findings

01

Successfully accounts for process uncertainty and plant-model mismatch

02

Enables satisfaction of joint chance constraints in case studies

03

Outperforms traditional nonlinear model predictive control

Abstract

Reinforcement Learning (RL) controllers have generated excitement within the control community. The primary advantage of RL controllers relative to existing methods is their ability to optimize uncertain systems independently of explicit assumption of process uncertainty. Recent focus on engineering applications has been directed towards the development of safe RL controllers. Previous works have proposed approaches to account for constraint satisfaction through constraint tightening from the domain of stochastic model predictive control. Here, we extend these approaches to account for plant-model mismatch. Specifically, we propose a data-driven approach that utilizes Gaussian processes for the offline simulation model and use the associated posterior uncertainty prediction to account for joint chance constraints and plant-model mismatch. The method is benchmarked against nonlinear…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mawbray/Lutein-Dataset
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Fault Detection and Control Systems · Process Optimization and Integration