Optimal control of batch processes via a deterministic Q-learning method

Abdelrahman ElMezain; Mohamed Saleh; Jie Zhang; Ahmed Soliman; Seif; Fateen

arXiv:1904.03654·cs.SY·April 16, 2019

Optimal control of batch processes via a deterministic Q-learning method

Abdelrahman ElMezain, Mohamed Saleh, Jie Zhang, Ahmed Soliman, Seif, Fateen

PDF

Open Access

TL;DR

This paper introduces a deterministic Q-learning approach for robust online control of batch reactors, enabling adaptation to unplanned changes and improving process optimization over traditional time-based methods.

Contribution

It presents a novel deterministic Q-learning method tailored for real-time optimization of batch processes, enhancing robustness against system disturbances.

Findings

01

Q-learning outperforms traditional methods in handling unplanned process changes

02

The proposed method achieves higher final product yields

03

It effectively minimizes undesired side products

Abstract

Dynamic optimization of nonlinear chemical systems -- such as batch reactors -- should be applied online, and the suitable control taken should be according to the current state of the system rather than the current time instant. The recent state of the art methods applies the control based on the current time instant only. This is not suitable for most cases, as it is not robust to possible changes in the system. This paper proposes a Deterministic Q-Learning method to conduct robust online optimization of batch reactors. In this paper, the Q-Learning method is applied on simple batch reactor models; and in order to show the effectiveness of the proposed method the results are compared to other dynamic optimization methods. The main advantage of the Q-learning method or the proposed method is that it can accommodate unplanned changes during the process via changing the control action;…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdaptive Dynamic Programming Control · Advanced Control Systems Optimization · Iterative Learning Control Systems