Stability-Preserving Automatic Tuning of PID Control with Reinforcement   Learning

Ayub I. Lakhani; Myisha A. Chowdhury; Qiugang Lu

arXiv:2112.15187·eess.SY·February 14, 2022·1 cites

Stability-Preserving Automatic Tuning of PID Control with Reinforcement Learning

Ayub I. Lakhani, Myisha A. Chowdhury, Qiugang Lu

PDF

Open Access

TL;DR

This paper introduces a reinforcement learning-based framework for automatic PID tuning that guarantees closed-loop stability during the tuning process, using a novel episodic approach and stability supervision.

Contribution

It presents a stability-preserving RL tuning method with an episodic framework, baseline initialization, and a supervisor to prevent instability during PID parameter optimization.

Findings

01

Maintains closed-loop stability during RL-based PID tuning.

02

Converges quickly to optimal PID parameters.

03

Validated on a second-order plus dead-time system.

Abstract

PID control has been the dominant control strategy in the process industry due to its simplicity in design and effectiveness in controlling a wide range of processes. However, traditional methods on PID tuning often require extensive domain knowledge and field experience. To address the issue, this work proposes an automatic PID tuning framework based on reinforcement learning (RL), particularly the deterministic policy gradient (DPG) method. Different from existing studies on using RL for PID tuning, in this work, we consider the closed-loop stability throughout the RL-based tuning process. In particular, we propose a novel episodic tuning framework that allows for an episodic closed-loop operation under selected PID parameters where the actor and critic networks are updated once at the end of each episode. To ensure the closed-loop stability during the tuning, we initialize the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExtremum Seeking Control Systems · Advanced Control Systems Optimization · Viral Infectious Diseases and Gene Expression in Insects

MethodsLayer Normalization