A Novel Sparse Regularizer

Hovig Tigran Bayandorian

arXiv:2301.07285·cs.LG·April 24, 2023

A Novel Sparse Regularizer

Hovig Tigran Bayandorian

PDF

Open Access

TL;DR

This paper introduces a new entropy-based regularizer that considers the spatial arrangement of weights, leading to significant sparsity improvements in neural network training.

Contribution

A novel entropy-based regularizer that differs from traditional norm-based methods by focusing on weight spatial distribution, offering simplicity and efficiency.

Findings

01

Achieves roughly tenfold reduction in nonzero parameters for LeNet300 on MNIST.

02

Is differentiable, scalable, and easy to implement in parallel.

03

Provides better sparsity-accuracy trade-offs compared to existing regularizers.

Abstract

$L_{p}$ -norm regularization schemes such as $L_{0}$ , $L_{1}$ , and $L_{2}$ -norm regularization and $L_{p}$ -norm-based regularization techniques such as weight decay, LASSO, and elastic net compute a quantity which depends on model weights considered in isolation from one another. This paper introduces a regularizer based on minimizing a novel measure of entropy applied to the model during optimization. In contrast with $L_{p}$ -norm-based regularization, this regularizer is concerned with the spatial arrangement of weights within a weight matrix. This novel regularizer is an additive term for the loss function and is differentiable, simple and fast to compute, scale-invariant, requires a trivial amount of additional memory, and can easily be parallelized. Empirically this method yields approximately a one order-of-magnitude improvement in the number of nonzero model parameters required to achieve a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Neural Networks and Applications · Image and Signal Denoising Methods

MethodsTest · Weight Decay