Neural Networks Generalize on Low Complexity Data
Sourav Chatterjee, Timothy Sudijono

TL;DR
This paper demonstrates that neural networks with ReLU activation can generalize well on low complexity data, such as primality testing, by using minimum description length principles to find simple interpolating models.
Contribution
It introduces a framework linking data complexity, description length, and neural network generalization, with theoretical results on primality testing.
Findings
MDL neural networks accurately classify primes with high probability
Networks generalize on low complexity data without explicit training for specific tasks
Extensions suggest robustness to noisy data and tempered overfitting
Abstract
We show that feedforward neural networks with ReLU activation generalize on low complexity data, suitably defined. Given i.i.d.~data generated from a simple programming language, the minimum description length (MDL) feedforward neural network which interpolates the data generalizes with high probability. We define this simple programming language, along with a notion of description length of such networks. We provide several examples on basic computational tasks, such as checking primality of a natural number. For primality testing, our theorem shows the following and more. Suppose that we draw an i.i.d.~sample of numbers uniformly at random from to . For each number , let if is a prime and if it is not. Then, the interpolating MDL network accurately answers, with probability , whether a newly drawn number between and is a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Minimum Description Length
