Intrinsic dimensionality and generalization properties of the   $\mathcal{R}$-norm inductive bias

Navid Ardeshir; Daniel Hsu; Clayton Sanford

arXiv:2206.05317·cs.LG·June 27, 2023

Intrinsic dimensionality and generalization properties of the $\mathcal{R}$-norm inductive bias

Navid Ardeshir, Daniel Hsu, Clayton Sanford

PDF

Open Access 1 Repo

TL;DR

This paper investigates the properties of $\\mathcal{R}$-norm minimizing interpolants in neural networks, revealing their multivariate nature and limitations in achieving optimal generalization, thus providing insights into neural network inductive biases.

Contribution

It characterizes the structural and statistical properties of $\\mathcal{R}$-norm interpolants, highlighting their multivariate complexity and limitations for optimal generalization.

Findings

01

Interpolants are intrinsically multivariate functions.

02

$\\mathcal{R}$-norm bias does not always lead to optimal generalization.

03

Results connect the inductive bias to practical neural network training.

Abstract

We study the structural and statistical properties of $R$ -norm minimizing interpolants of datasets labeled by specific target functions. The $R$ -norm is the basis of an inductive bias for two-layer neural networks, recently introduced to capture the functional effect of controlling the size of network weights, independently of the network width. We find that these interpolants are intrinsically multivariate functions, even when there are ridge functions that fit the data, and also that the $R$ -norm inductive bias is not sufficient for achieving statistically optimal generalization for certain learning problems. Altogether, these results shed new light on an inductive bias that is connected to practical neural network training.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chsanford/cnn-weight-tying
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Machine Learning and ELM · Model Reduction and Neural Networks