Unveiling Hidden Convexity in Deep Learning: a Sparse Signal Processing Perspective

Emi Zeger; Mert Pilanci

arXiv:2603.23831·cs.LG·March 26, 2026

Unveiling Hidden Convexity in Deep Learning: a Sparse Signal Processing Perspective

Emi Zeger, Mert Pilanci

PDF

Open Access

TL;DR

This paper explores hidden convex structures in deep neural networks, especially ReLU-based models, by linking them to sparse signal processing, which could improve understanding and training of neural networks.

Contribution

It reveals hidden convexities in ReLU neural networks and connects deep learning with sparse signal processing to enhance theoretical understanding.

Findings

01

Identification of convex equivalences in ReLU networks

02

Connection between neural networks and sparse signal models

03

Potential for improved training and analysis methods

Abstract

Deep neural networks (DNNs), particularly those using Rectified Linear Unit (ReLU) activation functions, have achieved remarkable success across diverse machine learning tasks, including image recognition, audio processing, and language modeling. Despite this success, the non-convex nature of DNN loss functions complicates optimization and limits theoretical understanding. In this paper, we highlight how recently developed convex equivalences of ReLU NNs and their connections to sparse signal processing models can address the challenges of training and understanding NNs. Recent research has uncovered several hidden convexities in the loss landscapes of certain NN architectures, notably two-layer ReLU networks and other deeper or varied architectures. This paper seeks to provide an accessible and educational overview that bridges recent advances in the mathematics of deep learning with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Advanced Neural Network Applications · Sparse and Compressive Sensing Techniques