UNREALIZABLE LEARNING IN FEEDFORWARD NEURAL NETWORKS

Matts Sporre

arXiv:cond-mat/9506029·cond-mat·May 23, 2007

UNREALIZABLE LEARNING IN FEEDFORWARD NEURAL NETWORKS

Matts Sporre

PDF

Open Access

TL;DR

This paper uses statistical mechanics to analyze unrealizable generalization in large feed-forward neural networks, revealing phase transitions and the effects of noise on learning performance.

Contribution

It provides a detailed theoretical analysis of unrealizable learning in perceptrons and tree committee machines, including phase transitions and the impact of noise, using replica symmetry and symmetry breaking.

Findings

01

Identifies phase transition for low noise in perceptrons.

02

Shows transition disappears with increased noise.

03

Finds decay of generalization error as (ln α / α)^k for large α.

Abstract

Statistical mechanics is used to study unrealizable generalization in two large feed-forward neural networks with binary weights and output, a perceptron and a tree committee machine. The student is trained by a teacher being larger, i.e. having more units than the student. It is shown that this is the same as using training data corrupted by Gaussian noise. Each machine is considered in the high temperature limit and in the replica symmetric approximation as well as for one step of replica symmetry breaking. For the perceptron a phase transition is found for low noise. However the transition is not to optimal learning. If the noise is increased the transition disappears. In both cases $ϵ_{g}$ will approach optimal performance with a $(ln α / α)^{k}$ decay for large $α$ . For the tree committee machine noise in the input layer is studied, as well as noise in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Statistical Mechanics and Entropy · Face and Expression Recognition