A Novel Sparse Regularizer
Hovig Tigran Bayandorian

TL;DR
This paper introduces a new entropy-based regularizer that considers the spatial arrangement of weights, leading to significant sparsity improvements in neural network training.
Contribution
A novel entropy-based regularizer that differs from traditional norm-based methods by focusing on weight spatial distribution, offering simplicity and efficiency.
Findings
Achieves roughly tenfold reduction in nonzero parameters for LeNet300 on MNIST.
Is differentiable, scalable, and easy to implement in parallel.
Provides better sparsity-accuracy trade-offs compared to existing regularizers.
Abstract
-norm regularization schemes such as , , and -norm regularization and -norm-based regularization techniques such as weight decay, LASSO, and elastic net compute a quantity which depends on model weights considered in isolation from one another. This paper introduces a regularizer based on minimizing a novel measure of entropy applied to the model during optimization. In contrast with -norm-based regularization, this regularizer is concerned with the spatial arrangement of weights within a weight matrix. This novel regularizer is an additive term for the loss function and is differentiable, simple and fast to compute, scale-invariant, requires a trivial amount of additional memory, and can easily be parallelized. Empirically this method yields approximately a one order-of-magnitude improvement in the number of nonzero model parameters required to achieve a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Neural Networks and Applications · Image and Signal Denoising Methods
MethodsTest · Weight Decay
