SPI-Optimizer: an integral-Separated PI Controller for Stochastic Optimization
Dan Wang, Mengqi Ji, Yong Wang, Haoqian Wang, Lu Fang

TL;DR
The paper introduces SPI-Optimizer, a novel integral-Separated PI controller-based optimizer that reduces oscillation and accelerates convergence in stochastic optimization without extra hyperparameters, outperforming existing methods.
Contribution
It proposes a hyperparameter-free, integral-Separated PI controller for optimization, addressing oscillation issues and improving convergence speed and accuracy.
Findings
Eliminates oscillation in stochastic optimization.
Reduces training epochs by up to 40%.
Achieves up to 27.5% error reduction on benchmarks.
Abstract
To overcome the oscillation problem in the classical momentum-based optimizer, recent work associates it with the proportional-integral (PI) controller, and artificially adds D term producing a PID controller. It suppresses oscillation with the sacrifice of introducing extra hyper-parameter. In this paper, we start by analyzing: why momentum-based method oscillates about the optimal point? and answering that: the fluctuation problem relates to the lag effect of integral (I) term. Inspired by the conditional integration idea in classical control society, we propose SPI-Optimizer, an integral-Separated PI controller based optimizer WITHOUT introducing extra hyperparameter. It separates momentum term adaptively when the inconsistency of current and historical gradient direction occurs. Extensive experiments demonstrate that SPIOptimizer generalizes well on popular network architectures to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Machine Learning and ELM · Error Correcting Code Techniques
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
