Exploring the loss landscape of regularized neural networks via convex   duality

Sungyoon Kim; Aaron Mishkin; Mert Pilanci

arXiv:2411.07729·cs.LG·April 30, 2025

Exploring the loss landscape of regularized neural networks via convex duality

Sungyoon Kim, Aaron Mishkin, Mert Pilanci

PDF

Open Access

TL;DR

This paper analyzes the loss landscape of regularized neural networks by transforming the problem into a convex dual, revealing the structure of solutions, phase transitions, and connectivity properties across different architectures.

Contribution

It introduces a convex duality framework to characterize stationary points, solution sets, and landscape topology of regularized neural networks, including phase transitions and solution nonuniqueness.

Findings

01

Characterization of stationary points via convex duality.

02

Identification of phase transitions in global optima topology.

03

Extension of results to various neural network architectures.

Abstract

We discuss several aspects of the loss landscape of regularized neural networks: the structure of stationary points, connectivity of optimal solutions, path with nonincreasing loss to arbitrary global optimum, and the nonuniqueness of optimal solutions, by casting the problem into an equivalent convex problem and considering its dual. Starting from two-layer neural networks with scalar output, we first characterize the solution set of the convex problem using its dual and further characterize all stationary points. With the characterization, we show that the topology of the global optima goes through a phase transition as the width of the network changes, and construct counterexamples where the problem may have a continuum of optimal solutions. Finally, we show that the solution set characterization and connectivity results can be extended to different architectures, including two-layer…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Machine Learning and ELM · Sparse and Compressive Sensing Techniques

MethodsSparse Evolutionary Training