The Rules-and-Facts Model for Simultaneous Generalization and Memorization in Neural Networks
Gabriele Farn\'e, Fabrizio Boncoraglio, Lenka Zdeborov\'a

TL;DR
This paper introduces the Rules-and-Facts (RAF) model, a theoretical framework that explains how neural networks can simultaneously learn underlying rules and memorize specific facts, highlighting the roles of overparameterization and regularization.
Contribution
The paper presents the RAF model, bridging rule generalization and memorization analysis, providing a minimal setting for understanding dual learning capabilities in neural networks.
Findings
Overparameterization supports simultaneous rule learning and memorization.
Regularization and kernel choice influence capacity allocation.
The model offers a theoretical basis for neural networks' dual learning ability.
Abstract
A key capability of modern neural networks is their capacity to simultaneously learn underlying rules and memorize specific facts or exceptions. Yet, theoretical understanding of this dual capability remains limited. We introduce the Rules-and-Facts (RAF) model, a minimal solvable setting that enables precise characterization of this phenomenon by bridging two classical lines of work in the statistical physics of learning: the teacher-student framework for generalization and Gardner-style capacity analysis for memorization. In the RAF model, a fraction of training labels is generated by a structured teacher rule, while a fraction consists of unstructured facts with random labels. We characterize when the learner can simultaneously recover the underlying rule - allowing generalization to new data - and memorize the unstructured examples. Our results…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Stochastic Gradient Optimization Techniques · Statistical Mechanics and Entropy
