Smooth over-parameterized solvers for non-smooth structured optimization
Clarice Poon, Gabriel Peyr\'e

TL;DR
This paper introduces a smooth over-parameterized approach for non-smooth structured optimization, improving convergence and efficiency in imaging and machine learning tasks by connecting gradient descent to mirror descent flows.
Contribution
It proposes a non-convex smooth reformulation of non-smooth problems, with theoretical convergence guarantees and an improved VarPro-based algorithm for faster optimization.
Findings
Dimension-free convergence bounds for the proposed method
Enhanced conditioning and convergence with Variable Projection (VarPro)
Effective application to inverse problems and supervised learning
Abstract
Non-smooth optimization is a core ingredient of many imaging or machine learning pipelines. Non-smoothness encodes structural constraints on the solutions, such as sparsity, group sparsity, low-rank and sharp edges. It is also the basis for the definition of robust loss functions and scale-free functionals such as square-root Lasso. Standard approaches to deal with non-smoothness leverage either proximal splitting or coordinate descent. These approaches are effective but usually require parameter tuning, preconditioning or some sort of support pruning. In this work, we advocate and study a different route, which operates a non-convex but smooth over-parametrization of the underlying non-smooth optimization problems. This generalizes quadratic variational forms that are at the heart of the popular Iterative Reweighted Least Squares (IRLS). Our main theoretical contribution connects…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Photoacoustic and Ultrasonic Imaging · Numerical methods in inverse problems
