Exponential Convergence of (Stochastic) Gradient Descent for Separable Logistic Regression

Sacchit Kale; Piyushi Manupriya; Pierre Marion; Francis Bach; Anant Raj

arXiv:2602.18946·cs.LG·March 2, 2026

Exponential Convergence of (Stochastic) Gradient Descent for Separable Logistic Regression

Sacchit Kale, Piyushi Manupriya, Pierre Marion, Francis Bach, Anant Raj

PDF

Open Access

TL;DR

This paper proves that increasing step sizes in gradient descent and stochastic gradient descent can achieve exponential convergence in separable logistic regression without instability, challenging the notion that acceleration requires edge-of-stability regimes.

Contribution

It introduces simple, non-adaptive step-size schedules that ensure exponential convergence within stable regimes, avoiding the need for complex adaptive methods or instability-based acceleration.

Findings

01

Gradient descent with increasing step sizes achieves exponential convergence.

02

Stochastic gradient descent also attains exponential rates with lightweight adaptive step sizes.

03

Stable, structured step-size growth suffices for acceleration in logistic regression.

Abstract

Gradient descent and stochastic gradient descent are central to modern machine learning, yet their behavior under large step sizes remains theoretically unclear. Recent work suggests that acceleration often arises near the edge of stability, where optimization trajectories become unstable and difficult to analyze. Existing results for separable logistic regression achieve faster convergence by explicitly leveraging such unstable regimes through constant or adaptive large step sizes. In this paper, we show that instability is not inherent to acceleration. We prove that gradient descent with a simple, non-adaptive increasing step-size schedule achieves exponential convergence for separable logistic regression under a margin condition, while remaining entirely within a stable optimization regime. The resulting method is anytime and does not require prior knowledge of the optimization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Gaussian Processes and Bayesian Inference · Privacy-Preserving Technologies in Data