Constrained Model-Free Reinforcement Learning for Process Optimization

Elton Pan; Panagiotis Petsagkourakis; Max Mowbray; Dongda Zhang,; Antonio del Rio-Chanona

arXiv:2011.07925·cs.LG·April 15, 2021·5 cites

Constrained Model-Free Reinforcement Learning for Process Optimization

Elton Pan, Panagiotis Petsagkourakis, Max Mowbray, Dongda Zhang,, Antonio del Rio-Chanona

PDF

Open Access

TL;DR

This paper introduces a constrained Q-learning algorithm that guarantees high-probability satisfaction of safety constraints, advancing reinforcement learning's applicability to real-world process control.

Contribution

It proposes an oracle-assisted, self-tuning constraint tightening method integrated into RL to ensure safety constraints are met with high probability in process optimization.

Findings

01

Algorithm guarantees constraint satisfaction with high probability

02

Performance comparable or superior to model predictive control

03

Applicable to safety-critical industrial processes

Abstract

Reinforcement learning (RL) is a control approach that can handle nonlinear stochastic optimal control problems. However, despite the promise exhibited, RL has yet to see marked translation to industrial practice primarily due to its inability to satisfy state constraints. In this work we aim to address this challenge. We propose an 'oracle'-assisted constrained Q-learning algorithm that guarantees the satisfaction of joint chance constraints with a high probability, which is crucial for safety critical tasks. To achieve this, constraint tightening (backoffs) are introduced and adjusted using Broyden's method, hence making them self-tuned. This results in a general methodology that can be imbued into approximate dynamic programming-based algorithms to ensure constraint satisfaction with high probability. Finally, we present case studies that analyze the performance of the proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Reinforcement Learning in Robotics · Process Optimization and Integration

MethodsQ-Learning