The proximal-proximal gradient algorithm
Ting Kei Pong

TL;DR
This paper introduces the proximal-proximal gradient algorithm for convex optimization problems involving a smooth and a composite nonsmooth term, providing convergence guarantees and demonstrating practical efficiency on large-scale problems.
Contribution
The paper proposes a new proximal-proximal gradient algorithm that simplifies subproblems for composite nonsmooth convex optimization, extending the applicability of proximal methods.
Findings
Algorithm converges to an optimal solution.
Provides an upper bound on iteration complexity.
Performs well on large-scale stochastic and logistic fused lasso problems.
Abstract
We consider the problem of minimizing a convex objective which is the sum of a smooth part, with Lipschitz continuous gradient, and a nonsmooth part. Inspired by various applications, we focus on the case when the nonsmooth part is a composition of a proper closed convex function P and a nonzero affine map, with the proximal mappings of \tau P, \tau > 0, easy to compute. In this case, a direct application of the widely used proximal gradient algorithm does not necessarily lead to easy subproblems. In view of this, we propose a new algorithm, the proximal-proximal gradient algorithm, which admits easy subproblems. Our algorithm reduces to the proximal gradient algorithm if the affine map is just the identity map and the stepsizes are suitably chosen, and it is equivalent to applying a variant of the alternating minimization algorithm [35] to the dual problem. Moreover, it is closely…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Optimization and Variational Analysis
