Soft Diamond Regularizers for Deep Learning
Olaoluwa Adigun, Bart Kosko

TL;DR
This paper introduces a new family of soft diamond regularizers based on alpha-stable distributions, which improve deep learning performance and sparsity, outperforming existing regularizers in image classification and language translation tasks.
Contribution
The paper proposes a novel soft diamond regularizer using alpha-stable distributions, addressing computational challenges with a look-up table, and demonstrates its effectiveness in deep learning applications.
Findings
Improved accuracy and sparsity in image classifiers.
Outperformed Laplacian regularizer in tests.
Effective in German-English translation tasks.
Abstract
This chapter presents the new family of soft diamond synaptic regularizers based on thick-tailed symmetric alpha stable probability bell curves. These new parametrized weight priors improved deep-learning performance on image and language-translation test sets and increased the sparsity of the trained weights. They outperformed the state-of-the-art hard-diamond Laplacian regularizer of sparse lasso regression and classification. The synaptic weight priors have power-law bell-curve tails that are thicker than the thin exponential tails of Gaussian bell curves that underly ridge regularizers. Their tails get thicker as the parameter decreases. These thicker tails model more impulsive behavior and allow for occasional distant search in synaptic weight spaces of extremely high dimension. The geometry of their constraint sets has a diamond shape. The shape…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsSparse Evolutionary Training
