Exponential Convergence of (Stochastic) Gradient Descent for Separable Logistic Regression
Sacchit Kale, Piyushi Manupriya, Pierre Marion, Francis Bach, Anant Raj

TL;DR
This paper proves that increasing step sizes in gradient descent and stochastic gradient descent can achieve exponential convergence in separable logistic regression without instability, challenging the notion that acceleration requires edge-of-stability regimes.
Contribution
It introduces simple, non-adaptive step-size schedules that ensure exponential convergence within stable regimes, avoiding the need for complex adaptive methods or instability-based acceleration.
Findings
Gradient descent with increasing step sizes achieves exponential convergence.
Stochastic gradient descent also attains exponential rates with lightweight adaptive step sizes.
Stable, structured step-size growth suffices for acceleration in logistic regression.
Abstract
Gradient descent and stochastic gradient descent are central to modern machine learning, yet their behavior under large step sizes remains theoretically unclear. Recent work suggests that acceleration often arises near the edge of stability, where optimization trajectories become unstable and difficult to analyze. Existing results for separable logistic regression achieve faster convergence by explicitly leveraging such unstable regimes through constant or adaptive large step sizes. In this paper, we show that instability is not inherent to acceleration. We prove that gradient descent with a simple, non-adaptive increasing step-size schedule achieves exponential convergence for separable logistic regression under a margin condition, while remaining entirely within a stable optimization regime. The resulting method is anytime and does not require prior knowledge of the optimization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Gaussian Processes and Bayesian Inference · Privacy-Preserving Technologies in Data
