Convergence diagnostics for stochastic gradient descent with constant step size
Jerry Chee, Panos Toulis

TL;DR
This paper introduces a statistical diagnostic for detecting phase transitions in stochastic gradient descent with constant step size, enabling adaptive learning rate adjustments to improve convergence efficiency.
Contribution
It develops a diagnostic test for phase transition detection in SGD, derives a closed-form for the convergence region, and proposes an adaptive learning rate method based on stationarity detection.
Findings
Diagnostic accurately detects convergence region
Adaptive learning rate improves convergence speed
Method performs comparably to state-of-the-art algorithms
Abstract
Many iterative procedures in stochastic optimization exhibit a transient phase followed by a stationary phase. During the transient phase the procedure converges towards a region of interest, and during the stationary phase the procedure oscillates in that region, commonly around a single point. In this paper, we develop a statistical diagnostic test to detect such phase transition in the context of stochastic gradient descent with constant learning rate. We present theory and experiments suggesting that the region where the proposed diagnostic is activated coincides with the convergence region. For a class of loss functions, we derive a closed-form solution describing such region. Finally, we suggest an application to speed up convergence of stochastic gradient descent by halving the learning rate each time stationarity is detected. This leads to a new variant of stochastic gradient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Machine Learning and Algorithms · Sparse and Compressive Sensing Techniques
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
