One-shot, Offline and Production-Scalable PID Optimisation with Deep   Reinforcement Learning

Zacharaya Shabka; Michael Enrico; Nick Parsons; Georgios Zervas

arXiv:2210.13906·eess.SY·October 26, 2022

One-shot, Offline and Production-Scalable PID Optimisation with Deep Reinforcement Learning

Zacharaya Shabka, Michael Enrico, Nick Parsons, Georgios Zervas

PDF

Open Access

TL;DR

This paper introduces a deep reinforcement learning method for PID parameter optimization that is scalable, fast, and effective, significantly improving control performance in industrial optical switching applications with minimal online overhead.

Contribution

The paper presents a novel deep RL approach for offline, production-scalable PID tuning that drastically reduces tuning time and enhances control performance in industrial settings.

Findings

01

Achieved a 5x increase in actuators meeting challenging switching speed targets.

02

Realized a 20% improvement in mean switching speed at constant optical loss.

03

Reduced performance inconsistency by over 75% across temperature variations.

Abstract

Proportional-integral-derivative (PID) control underlies more than $97%$ of automated industrial processes. Controlling these processes effectively with respect to some specified set of performance goals requires finding an optimal set of PID parameters to moderate the PID loop. Tuning these parameters is a long and exhaustive process. A method (patent pending) based on deep reinforcement learning is presented that learns a relationship between generic system properties (e.g. resonance frequency), a multi-objective performance goal and optimal PID parameter values. Performance is demonstrated in the context of a real optical switching product of the foremost manufacturer of such devices globally. Switching is handled by piezoelectric actuators where switching time and optical loss are derived from the speed and stability of actuator-control processes respectively. The method achieves a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIterative Learning Control Systems · Extremum Seeking Control Systems · Semiconductor Lasers and Optical Devices

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings