Gradient Descent Converges Linearly for Logistic Regression on Separable   Data

Kyriakos Axiotis; Maxim Sviridenko

arXiv:2306.14381·cs.LG·June 27, 2023

Gradient Descent Converges Linearly for Logistic Regression on Separable Data

Kyriakos Axiotis, Maxim Sviridenko

PDF

Open Access 1 Video

TL;DR

This paper proves that gradient descent with variable learning rates achieves linear convergence for logistic regression on separable data, improving understanding of convergence behavior without strong convexity.

Contribution

It demonstrates that variable learning rates enable linear convergence of gradient descent for logistic regression on separable data, challenging previous assumptions.

Findings

01

Loss converges exponentially with iterations.

02

Variable learning rates are crucial for linear convergence.

03

Sparse logistic regression benefits from improved sparsity-error tradeoff.

Abstract

We show that running gradient descent with variable learning rate guarantees loss $f (x) \leq 1.1 \cdot f (x^{*}) + ϵ$ for the logistic regression objective, where the error $ϵ$ decays exponentially with the number of iterations and polynomially with the magnitude of the entries of an arbitrary fixed solution $x^{*}$ . This is in contrast to the common intuition that the absence of strong convexity precludes linear convergence of first-order methods, and highlights the importance of variable learning rates for gradient descent. We also apply our ideas to sparse logistic regression, where they lead to an exponential improvement of the sparsity-error tradeoff.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Gradient Descent Converges Linearly for Logistic Regression on Separable Data· slideslive

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Statistical Methods and Inference

MethodsLogistic Regression