Representation and Regression Problems in Neural Networks: Relaxation,   Generalization, and Numerics

Kang Liu; Enrique Zuazua

arXiv:2412.01619·cs.LG·April 4, 2025

Representation and Regression Problems in Neural Networks: Relaxation, Generalization, and Numerics

Kang Liu, Enrique Zuazua

PDF

Open Access

TL;DR

This paper develops a convexified framework for training shallow neural networks addressing representation and regression problems, providing theoretical guarantees, generalization bounds, and efficient algorithms for different data dimensions.

Contribution

It introduces a mean-field convexification approach, proves the absence of relaxation gaps, and proposes scalable algorithms for high-dimensional data.

Findings

01

Convexification via mean-field approach eliminates relaxation gaps.

02

Generalization bounds depend on key hyperparameters.

03

Efficient algorithms are proposed for low- and high-dimensional datasets.

Abstract

In this work, we address three non-convex optimization problems associated with the training of shallow neural networks (NNs) for exact and approximate representation, as well as for regression tasks. Through a mean-field approach, we convexify these problems and, applying a representer theorem, prove the absence of relaxation gaps. We establish generalization bounds for the resulting NN solutions, assessing their predictive performance on test datasets and, analyzing the impact of key hyperparameters on these bounds, propose optimal choices. On the computational side, we examine the discretization of the convexified problems and derive convergence rates. For low-dimensional datasets, these discretized problems are efficiently solvable using the simplex method. For high-dimensional datasets, we propose a sparsification algorithm that, combined with gradient descent for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications