Soft Diamond Regularizers for Deep Learning

Olaoluwa Adigun; Bart Kosko

arXiv:2412.20724·stat.ML·September 15, 2025

Soft Diamond Regularizers for Deep Learning

Olaoluwa Adigun, Bart Kosko

PDF

Open Access

TL;DR

This paper introduces a new family of soft diamond regularizers based on alpha-stable distributions, which improve deep learning performance and sparsity, outperforming existing regularizers in image classification and language translation tasks.

Contribution

The paper proposes a novel soft diamond regularizer using alpha-stable distributions, addressing computational challenges with a look-up table, and demonstrates its effectiveness in deep learning applications.

Findings

01

Improved accuracy and sparsity in image classifiers.

02

Outperformed Laplacian regularizer in tests.

03

Effective in German-English translation tasks.

Abstract

This chapter presents the new family of soft diamond synaptic regularizers based on thick-tailed symmetric alpha stable $S α S$ probability bell curves. These new parametrized weight priors improved deep-learning performance on image and language-translation test sets and increased the sparsity of the trained weights. They outperformed the state-of-the-art hard-diamond Laplacian regularizer of sparse lasso regression and classification. The $S α S$ synaptic weight priors have power-law bell-curve tails that are thicker than the thin exponential tails of Gaussian bell curves that underly ridge regularizers. Their tails get thicker as the $α$ parameter decreases. These thicker tails model more impulsive behavior and allow for occasional distant search in synaptic weight spaces of extremely high dimension. The geometry of their constraint sets has a diamond shape. The shape…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsSparse Evolutionary Training