Neural Networks Generalize on Low Complexity Data

Sourav Chatterjee; Timothy Sudijono

arXiv:2409.12446·cs.LG·March 3, 2026

Neural Networks Generalize on Low Complexity Data

Sourav Chatterjee, Timothy Sudijono

PDF

Open Access

TL;DR

This paper demonstrates that neural networks with ReLU activation can generalize well on low complexity data, such as primality testing, by using minimum description length principles to find simple interpolating models.

Contribution

It introduces a framework linking data complexity, description length, and neural network generalization, with theoretical results on primality testing.

Findings

01

MDL neural networks accurately classify primes with high probability

02

Networks generalize on low complexity data without explicit training for specific tasks

03

Extensions suggest robustness to noisy data and tempered overfitting

Abstract

We show that feedforward neural networks with ReLU activation generalize on low complexity data, suitably defined. Given i.i.d.~data generated from a simple programming language, the minimum description length (MDL) feedforward neural network which interpolates the data generalizes with high probability. We define this simple programming language, along with a notion of description length of such networks. We provide several examples on basic computational tasks, such as checking primality of a natural number. For primality testing, our theorem shows the following and more. Suppose that we draw an i.i.d.~sample of $n$ numbers uniformly at random from $1$ to $N$ . For each number $x_{i}$ , let $y_{i} = 1$ if $x_{i}$ is a prime and $0$ if it is not. Then, the interpolating MDL network accurately answers, with probability $1 - O ((ln N) / n)$ , whether a newly drawn number between $1$ and $N$ is a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Minimum Description Length