UNREALIZABLE LEARNING IN FEEDFORWARD NEURAL NETWORKS
Matts Sporre

TL;DR
This paper uses statistical mechanics to analyze unrealizable generalization in large feed-forward neural networks, revealing phase transitions and the effects of noise on learning performance.
Contribution
It provides a detailed theoretical analysis of unrealizable learning in perceptrons and tree committee machines, including phase transitions and the impact of noise, using replica symmetry and symmetry breaking.
Findings
Identifies phase transition for low noise in perceptrons.
Shows transition disappears with increased noise.
Finds decay of generalization error as (ln α / α)^k for large α.
Abstract
Statistical mechanics is used to study unrealizable generalization in two large feed-forward neural networks with binary weights and output, a perceptron and a tree committee machine. The student is trained by a teacher being larger, i.e. having more units than the student. It is shown that this is the same as using training data corrupted by Gaussian noise. Each machine is considered in the high temperature limit and in the replica symmetric approximation as well as for one step of replica symmetry breaking. For the perceptron a phase transition is found for low noise. However the transition is not to optimal learning. If the noise is increased the transition disappears. In both cases will approach optimal performance with a decay for large . For the tree committee machine noise in the input layer is studied, as well as noise in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Statistical Mechanics and Entropy · Face and Expression Recognition
