A theory of learning with constrained weight-distribution
Weishun Zhong, Ben Sorscher, Daniel D Lee, Haim Sompolinsky

TL;DR
This paper develops a statistical mechanical theory of learning in neural networks with structural constraints on weights, predicting capacity reductions and proposing algorithms that incorporate prior weight distributions to improve learning.
Contribution
It introduces a novel theoretical framework linking weight distribution constraints to neural network capacity and develops algorithms based on optimal transport to implement these constraints.
Findings
Capacity reduction relates to Wasserstein distance between distributions.
Distribution-constrained learning outperforms unconstrained methods.
Optimal transport-based algorithms effectively incorporate prior weight knowledge.
Abstract
A central question in computational neuroscience is how structure determines function in neural networks. The emerging high-quality large-scale connectomic datasets raise the question of what general functional principles can be gleaned from structural information such as the distribution of excitatory/inhibitory synapse types and the distribution of synaptic weights. Motivated by this question, we developed a statistical mechanical theory of learning in neural networks that incorporates structural information as constraints. We derived an analytical solution for the memory capacity of the perceptron, a basic feedforward model of supervised learning, with constraint on the distribution of its weights. Our theory predicts that the reduction in capacity due to the constrained weight-distribution is related to the Wasserstein distance between the imposed distribution and that of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Topological and Geometric Data Analysis · Neural dynamics and brain function
MethodsTest
