Bounds on learning in polynomial time

Heinz Horner; Anthea Bethge

arXiv:cond-mat/9705259·cond-mat.dis-nn·February 8, 2017

Bounds on learning in polynomial time

Heinz Horner, Anthea Bethge

PDF

TL;DR

This paper investigates the limits of polynomial-time learning in neural networks, analyzing capacity bounds and revealing discrepancies between theoretical predictions and empirical results, especially for the committee machine.

Contribution

It introduces new bounds on learning capacity for various neural architectures and clarifies the relationship between storage capacity and polynomial learning time.

Findings

01

Polynomial learning algorithms can be practical for certain network sizes.

02

Discrepancies exist between theoretical capacity predictions and empirical results for the committee machine.

03

New simulations reveal subtleties in defining learning capacity and time dependence.

Abstract

The performance of large neural networks can be judged not only by their storage capacity but also by the time required for learning. A polynomial learning algorithm with learning time $\sim N^{2}$ in a network with $N$ units might be practical whereas a learning time $\sim e^{N}$ would allow rather small networks only. The question of absolute storage capacity $α_{c}$ and capacity for polynomial learning rules $α_{p}$ is discussed for several feed-forward architectures, the perceptron, the binary perceptron, the committee machine and a perceptron with fixed weights in the first layer and adaptive weights in the second layer. The analysis is based partially on dynamic mean field theory which is valid for $N \to \infty$ . Especially for the committee machine a value $α_{p}$ considerably lower than the capacity predicted by replica theory or simulations is found. This discrepancy is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.