Thinking Outside the Ball: Optimal Learning with Gradient Descent for   Generalized Linear Stochastic Convex Optimization

Idan Amir; Roi Livni; Nathan Srebro

arXiv:2202.13328·cs.LG·November 1, 2022

Thinking Outside the Ball: Optimal Learning with Gradient Descent for Generalized Linear Stochastic Convex Optimization

Idan Amir, Roi Livni, Nathan Srebro

PDF

Open Access 1 Video

TL;DR

This paper demonstrates that early stopped gradient descent can achieve optimal sample complexity for generalized linear stochastic convex optimization without regularization, contrasting with traditional methods requiring more iterations.

Contribution

It introduces a novel analysis leveraging distribution-dependent uniform convergence to attain optimal learning rates with fewer iterations in generalized linear convex optimization.

Findings

01

Early stopping with gradient descent achieves optimal sample complexity.

02

Uniform convergence in distribution-dependent balls is key to improved efficiency.

03

Contrast with standard stochastic convex optimization requiring more iterations.

Abstract

We consider linear prediction with a convex Lipschitz loss, or more generally, stochastic convex optimization problems of generalized linear form, i.e.~where each instantaneous loss is a scalar convex function of a linear function. We show that in this setting, early stopped Gradient Descent (GD), without any explicit regularization or projection, ensures excess error at most $ϵ$ (compared to the best possible with unit Euclidean norm) with an optimal, up to logarithmic factors, sample complexity of $\tilde{O} (1/ ϵ^{2})$ and only $\tilde{O} (1/ ϵ^{2})$ iterations. This contrasts with general stochastic convex optimization, where $Ω (1/ ϵ^{4})$ iterations are needed Amir et al. [2021b]. The lower iteration complexity is ensured by leveraging uniform convergence rather than stability. But instead of uniform convergence in a norm ball, which we show can guarantee…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Thinking Outside the Ball: Optimal Learning with Gradient Descent for Generalized Linear Stochastic Convex Optimization· slideslive

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Statistical Methods and Inference