One-shot, Offline and Production-Scalable PID Optimisation with Deep Reinforcement Learning
Zacharaya Shabka, Michael Enrico, Nick Parsons, Georgios Zervas

TL;DR
This paper introduces a deep reinforcement learning method for PID parameter optimization that is scalable, fast, and effective, significantly improving control performance in industrial optical switching applications with minimal online overhead.
Contribution
The paper presents a novel deep RL approach for offline, production-scalable PID tuning that drastically reduces tuning time and enhances control performance in industrial settings.
Findings
Achieved a 5x increase in actuators meeting challenging switching speed targets.
Realized a 20% improvement in mean switching speed at constant optical loss.
Reduced performance inconsistency by over 75% across temperature variations.
Abstract
Proportional-integral-derivative (PID) control underlies more than of automated industrial processes. Controlling these processes effectively with respect to some specified set of performance goals requires finding an optimal set of PID parameters to moderate the PID loop. Tuning these parameters is a long and exhaustive process. A method (patent pending) based on deep reinforcement learning is presented that learns a relationship between generic system properties (e.g. resonance frequency), a multi-objective performance goal and optimal PID parameter values. Performance is demonstrated in the context of a real optical switching product of the foremost manufacturer of such devices globally. Switching is handled by piezoelectric actuators where switching time and optical loss are derived from the speed and stability of actuator-control processes respectively. The method achieves a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIterative Learning Control Systems · Extremum Seeking Control Systems · Semiconductor Lasers and Optical Devices
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
