# Sparseout: Controlling Sparsity in Deep Networks

**Authors:** Najeeb Khan, Ian Stavness

arXiv: 1904.08050 · 2019-04-18

## TL;DR

Sparseout is a novel regularization technique that explicitly controls activation sparsity in deep neural networks, improving performance in language modeling and image classification by adjusting sparsity levels.

## Contribution

It introduces Sparseout, a simple, efficient method that generalizes Dropout to regulate sparsity, with theoretical proof linking it to $L_q$ penalties and empirical validation across tasks.

## Key findings

- Sparsity benefits language modeling performance.
- Denser activations improve image classification.
- Sparseout effectively controls activation sparsity.

## Abstract

Dropout is commonly used to help reduce overfitting in deep neural networks. Sparsity is a potentially important property of neural networks, but is not explicitly controlled by Dropout-based regularization. In this work, we propose Sparseout a simple and efficient variant of Dropout that can be used to control the sparsity of the activations in a neural network. We theoretically prove that Sparseout is equivalent to an $L_q$ penalty on the features of a generalized linear model and that Dropout is a special case of Sparseout for neural networks. We empirically demonstrate that Sparseout is computationally inexpensive and is able to control the desired level of sparsity in the activations. We evaluated Sparseout on image classification and language modelling tasks to see the effect of sparsity on these tasks. We found that sparsity of the activations is favorable for language modelling performance while image classification benefits from denser activations. Sparseout provides a way to investigate sparsity in state-of-the-art deep learning models. Source code for Sparseout could be found at \url{https://github.com/najeebkhan/sparseout}.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.08050/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/1904.08050/full.md

## References

40 references — full list in the complete paper: https://tomesphere.com/paper/1904.08050/full.md

---
Source: https://tomesphere.com/paper/1904.08050