Optimization with Sparsity-Inducing Penalties

Francis Bach (LIENS; INRIA Paris - Rocquencourt); Rodolphe Jenatton; (LIENS; INRIA Paris - Rocquencourt); Julien Mairal; Guillaume Obozinski; (LIENS; INRIA Paris - Rocquencourt)

arXiv:1108.0775·cs.LG·November 24, 2011

Optimization with Sparsity-Inducing Penalties

Francis Bach (LIENS, INRIA Paris - Rocquencourt), Rodolphe Jenatton, (LIENS, INRIA Paris - Rocquencourt), Julien Mairal, Guillaume Obozinski, (LIENS, INRIA Paris - Rocquencourt)

PDF

TL;DR

This paper reviews optimization techniques for sparsity-inducing penalties in data and model estimation, covering convex and non-convex methods, with extensive experimental comparisons.

Contribution

It provides a comprehensive overview of optimization tools and algorithms tailored for sparsity-inducing penalties, including recent extensions and practical comparisons.

Findings

01

Proximal and block-coordinate methods effectively handle non-smooth penalties.

02

Reweighted and homotopy techniques improve convergence in sparse optimization.

03

Extensive experiments compare computational efficiency of various algorithms.

Abstract

Sparse estimation methods are aimed at using or obtaining parsimonious representations of data or models. They were first dedicated to linear variable selection but numerous extensions have now emerged such as structured sparsity or kernel selection. It turns out that many of the related estimation problems can be cast as convex optimization problems by regularizing the empirical risk with appropriate non-smooth norms. The goal of this paper is to present from a general perspective optimization tools and techniques dedicated to such sparsity-inducing penalties. We cover proximal methods, block-coordinate descent, reweighted $ℓ_{2}$ -penalized techniques, working-set and homotopy methods, as well as non-convex formulations and extensions, and provide an extensive set of experiments to compare various algorithms from a computational point of view.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.