Learning and generalization theories of large committee--machines
Remi Monasson, Riccardo Zecchina

TL;DR
This paper derives the critical learning capacity and stability conditions for large committee machines, revealing a Bayesian generalization crossover at a specific capacity related to the number of hidden units.
Contribution
It introduces theoretical formulas for the learning capacity and stability of large committee machines, advancing understanding of their generalization behavior.
Findings
Critical learning capacity $rac{16}{ ext{ extpi}}\, ext{sqrt}( ext{ln} K)$ derived
Stability of solutions verified in the large $K$ limit
Bayesian generalization crossover identified at $ ext{alpha}=K$
Abstract
The study of the distribution of volumes associated to the internal representations of learning examples allows us to derive the critical learning capacity () of large committee machines, to verify the stability of the solution in the limit of a large number of hidden units and to find a Bayesian generalization cross--over at .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTheoretical and Computational Physics · Quantum many-body systems · Statistical Mechanics and Entropy
