On the Proximal Gradient Algorithm with Alternated Inertia

Franck Iutzeler (1); Jerome Malick (1) ((1) DAO)

arXiv:1801.05589·math.OC·January 18, 2018·J. Optim. Theory Appl.

On the Proximal Gradient Algorithm with Alternated Inertia

Franck Iutzeler (1), Jerome Malick (1) ((1) DAO)

PDF

TL;DR

This paper studies a variant of the proximal gradient algorithm that uses alternated inertia, demonstrating its monotonic convergence and providing convergence rates based on local geometry, with practical illustrations.

Contribution

It introduces and analyzes the proximal gradient algorithm with alternated inertia, showing its advantages over traditional accelerated methods.

Findings

01

Monotonically decreasing functional values with alternated inertia

02

Convergence rates based on local geometric properties

03

Effective in common regularized problems

Abstract

In this paper, we investigate the attractive properties of the proximal gradient algorithm with inertia. Notably, we show that using alternated inertia yields monotonically decreasing functional values, which contrasts with usual accelerated proximal gradient methods. We also provide convergence rates for the algorithm with alternated inertia based on local geometric properties of the objective function. The results are put into perspective by discussions on several extensions and illustrations on common regularized problems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.