What Kinds of Functions do Deep Neural Networks Learn? Insights from   Variational Spline Theory

Rahul Parhi; Robert D. Nowak

arXiv:2105.03361·stat.ML·April 18, 2022

What Kinds of Functions do Deep Neural Networks Learn? Insights from Variational Spline Theory

Rahul Parhi, Robert D. Nowak

PDF

TL;DR

This paper introduces a variational framework and a new function space to analyze the properties of functions learned by deep neural networks with ReLU activations, linking neural networks to spline theory and sparsity.

Contribution

It develops a novel variational approach and function space that characterize deep neural network solutions, connecting architectural features to function regularization and spline theory.

Findings

01

Deep ReLU networks are solutions to regularized data fitting in the proposed function space.

02

The function space captures compositional structure and sparsity, explaining architectural choices.

03

The framework links neural networks to classical spline theory, providing new theoretical insights.

Abstract

We develop a variational framework to understand the properties of functions learned by fitting deep neural networks with rectified linear unit activations to data. We propose a new function space, which is reminiscent of classical bounded variation-type spaces, that captures the compositional structure associated with deep neural networks. We derive a representer theorem showing that deep ReLU networks are solutions to regularized data fitting problems over functions from this space. The function space consists of compositions of functions from the Banach spaces of second-order bounded variation in the Radon domain. These are Banach spaces with sparsity-promoting norms, giving insight into the role of sparsity in deep neural networks. The neural network solutions have skip connections and rank bounded weight matrices, providing new theoretical support for these common architectural…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsWeight Decay