"Active-set complexity" of proximal gradient: How long does it take to find the sparsity pattern?
Julie Nutini, Mark Schmidt, Warren Hare

TL;DR
This paper introduces the concept of active-set complexity for proximal gradient methods, providing bounds on the number of iterations needed to identify the optimal sparsity pattern in certain convex optimization problems.
Contribution
It defines active-set complexity and establishes bounds for proximal gradient methods in problems with strongly-convex smooth and separable non-smooth functions.
Findings
Active-set complexity bounds for proximal gradient methods.
Finite identification of sparsity pattern under nondegeneracy conditions.
Application to problems with L1-regularization and non-negative constraints.
Abstract
Proximal gradient methods have been found to be highly effective for solving minimization problems with non-negative constraints or L1-regularization. Under suitable nondegeneracy conditions, it is known that these algorithms identify the optimal sparsity pattern for these types of problems in a finite number of iterations. However, it is not known how many iterations this may take. We introduce the notion of the "active-set complexity", which in these cases is the number of iterations before an algorithm is guaranteed to have identified the final sparsity pattern. We further give a bound on the active-set complexity of proximal gradient methods in the common case of minimizing the sum of a strongly-convex smooth function and a separable convex non-smooth function.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
