Stability-Preserving Automatic Tuning of PID Control with Reinforcement Learning
Ayub I. Lakhani, Myisha A. Chowdhury, Qiugang Lu

TL;DR
This paper introduces a reinforcement learning-based framework for automatic PID tuning that guarantees closed-loop stability during the tuning process, using a novel episodic approach and stability supervision.
Contribution
It presents a stability-preserving RL tuning method with an episodic framework, baseline initialization, and a supervisor to prevent instability during PID parameter optimization.
Findings
Maintains closed-loop stability during RL-based PID tuning.
Converges quickly to optimal PID parameters.
Validated on a second-order plus dead-time system.
Abstract
PID control has been the dominant control strategy in the process industry due to its simplicity in design and effectiveness in controlling a wide range of processes. However, traditional methods on PID tuning often require extensive domain knowledge and field experience. To address the issue, this work proposes an automatic PID tuning framework based on reinforcement learning (RL), particularly the deterministic policy gradient (DPG) method. Different from existing studies on using RL for PID tuning, in this work, we consider the closed-loop stability throughout the RL-based tuning process. In particular, we propose a novel episodic tuning framework that allows for an episodic closed-loop operation under selected PID parameters where the actor and critic networks are updated once at the end of each episode. To ensure the closed-loop stability during the tuning, we initialize the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExtremum Seeking Control Systems · Advanced Control Systems Optimization · Viral Infectious Diseases and Gene Expression in Insects
MethodsLayer Normalization
