A telescoping Bregmanian proximal gradient method without the global   Lipschitz continuity assumption

Daniel Reem; Simeon Reich; Alvaro De Pierro

arXiv:1804.10273·math.OC·November 12, 2019

A telescoping Bregmanian proximal gradient method without the global Lipschitz continuity assumption

Daniel Reem, Simeon Reich, Alvaro De Pierro

PDF

TL;DR

This paper introduces a novel proximal gradient method that operates without requiring the gradient of the smooth part to be globally Lipschitz continuous, broadening its applicability in various spaces.

Contribution

It proposes a telescoping Bregmanian proximal gradient method that relaxes the global Lipschitz continuity assumption, using a telescopic decomposition and Bregman divergence.

Findings

01

Establishes a non-asymptotic convergence rate in function values.

02

Proves weak convergence of the entire sequence to a minimizer.

03

Applicable to finite and infinite-dimensional spaces.

Abstract

The problem of minimization of the sum of two convex functions has various theoretical and real-world applications. One of the popular methods for solving this problem is the proximal gradient method (proximal forward-backward algorithm). A very common assumption in the use of this method is that the gradient of the smooth term is globally Lipschitz continuous. However, this assumption is not always satisfied in practice, thus casting a limitation on the method. In this paper, we discuss, in a wide class of finite and infinite-dimensional spaces, a new variant of the proximal gradient method which does not impose the above-mentioned global Lipschitz continuity assumption. A key contribution of the method is the dependence of the iterative steps on a certain telescopic decomposition of the constraint set into subsets. Moreover, we use a Bregman divergence in the proximal forward-backward…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.