Learning and generalization theories of large committee--machines

Remi Monasson; Riccardo Zecchina

arXiv:cond-mat/9601122·cond-mat·May 23, 2007

Learning and generalization theories of large committee--machines

Remi Monasson, Riccardo Zecchina

PDF

Open Access

TL;DR

This paper derives the critical learning capacity and stability conditions for large committee machines, revealing a Bayesian generalization crossover at a specific capacity related to the number of hidden units.

Contribution

It introduces theoretical formulas for the learning capacity and stability of large committee machines, advancing understanding of their generalization behavior.

Findings

01

Critical learning capacity $rac{16}{ ext{ extpi}}\, ext{sqrt}( ext{ln} K)$ derived

02

Stability of solutions verified in the large $K$ limit

03

Bayesian generalization crossover identified at $ ext{alpha}=K$

Abstract

The study of the distribution of volumes associated to the internal representations of learning examples allows us to derive the critical learning capacity ( $α_{c} = \frac{16}{π} ln K$ ) of large committee machines, to verify the stability of the solution in the limit of a large number $K$ of hidden units and to find a Bayesian generalization cross--over at $α = K$ .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTheoretical and Computational Physics · Quantum many-body systems · Statistical Mechanics and Entropy