Smoothing the Edges: Smooth Optimization for Sparse Regularization using Hadamard Overparametrization

Chris Kolb; Christian L. M\"uller; Bernd Bischl; David R\"ugamer

arXiv:2307.03571·cs.LG·April 9, 2026

Smoothing the Edges: Smooth Optimization for Sparse Regularization using Hadamard Overparametrization

Chris Kolb, Christian L. M\"uller, Bernd Bischl, David R\"ugamer

PDF

TL;DR

This paper introduces a smooth optimization framework for sparse regularization that leverages overparameterization and penalty changes, enabling fully differentiable solutions compatible with gradient descent.

Contribution

The authors develop a novel overparameterization and penalty transformation method that ensures equivalence of solutions, facilitating smooth optimization for sparse regularization problems.

Findings

01

The surrogate objective has identical global and local minima as the original.

02

The method is effective in high-dimensional regression and neural network sparsity.

03

Theoretical results guarantee the absence of spurious solutions.

Abstract

We present a framework for smooth optimization of explicitly regularized objectives for (structured) sparsity. These non-smooth and possibly non-convex problems typically rely on solvers tailored to specific models and regularizers. In contrast, our method enables fully differentiable and approximation-free optimization and is thus compatible with the ubiquitous gradient descent paradigm in deep learning. The proposed optimization transfer comprises an overparameterization of selected parameters and a change of penalties. In the overparametrized problem, smooth surrogate regularization induces non-smooth, sparse regularization in the base parametrization. We prove that the surrogate objective is equivalent in the sense that it not only has identical global minima but also matching local minima, thereby avoiding the introduction of spurious solutions. Additionally, our theory establishes…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.