Sequential convergence of AdaGrad algorithm for smooth convex   optimization

Cheik Traor\'e; Edouard Pauwels

arXiv:2011.12341·math.OC·April 14, 2021

Sequential convergence of AdaGrad algorithm for smooth convex optimization

Cheik Traor\'e, Edouard Pauwels

PDF

TL;DR

This paper proves the convergence of AdaGrad algorithms, both scalar and coordinatewise variants, for smooth convex functions with Lipschitz gradients by establishing a variable metric quasi-Fejér monotonicity property.

Contribution

It introduces a novel convergence proof for AdaGrad algorithms using a variable metric quasi-Fejér monotonicity approach, applicable to smooth convex optimization.

Findings

01

AdaGrad sequences are convergent for convex functions with Lipschitz gradients.

02

The proof relies on the quasi-Fejér monotonicity property.

03

Both scalar and coordinatewise AdaGrad variants are covered.

Abstract

We prove that the iterates produced by, either the scalar step size variant, or the coordinatewise variant of AdaGrad algorithm, are convergent sequences when applied to convex objective functions with Lipschitz gradient. The key insight is to remark that such AdaGrad sequences satisfy a variable metric quasi-Fej\'er monotonicity property, which allows to prove convergence.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAdaGrad