
TL;DR
Neural Plasticity Networks (NPN) introduce a unified framework that dynamically prunes or expands neural networks during training, inspired by brain plasticity, and can interpolate between dropout and full training.
Contribution
This work presents the first end-to-end learning framework that unifies network sparsification and expansion based on $L_0$-norm regularization and stochastic binary gates.
Findings
NPN outperforms traditional methods on image classification benchmarks.
Both pruning and expansion yield compact models with competitive accuracy.
The framework seamlessly interpolates between dropout and full network training.
Abstract
Neural plasticity is an important functionality of human brain, in which number of neurons and synapses can shrink or expand in response to stimuli throughout the span of life. We model this dynamic learning process as an -norm regularized binary optimization problem, in which each unit of a neural network (e.g., weight, neuron or channel, etc.) is attached with a stochastic binary gate, whose parameters determine the level of activity of a unit in the network. At the beginning, only a small portion of binary gates (therefore the corresponding neurons) are activated, while the remaining neurons are in a hibernation mode. As the learning proceeds, some neurons might be activated or deactivated if doing so can be justified by the cost-benefit tradeoff measured by the -norm regularized objective. As the training gets mature, the probability of transition between activation and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Neural Networks and Applications · Machine Learning in Materials Science
MethodsDropout
