Whiteout: Gaussian Adaptive Noise Regularization in Deep Neural Networks

Yinan Li; Fang Liu

arXiv:1612.01490·stat.ML·June 28, 2021

Whiteout: Gaussian Adaptive Noise Regularization in Deep Neural Networks

Yinan Li, Fang Liu

PDF

TL;DR

Whiteout introduces a novel Gaussian noise regularization technique for deep neural networks that promotes sparsity and stabilizes training without relying on traditional $l_2$ regularization, demonstrating superior performance especially on small datasets.

Contribution

It is the first to thoroughly analyze and develop Gaussian noise-based regularization for deep NNs, extending to adaptive lasso and group lasso, with theoretical and empirical validation.

Findings

01

Whiteout stabilizes neural network training.

02

Whiteout outperforms Bernoulli NIRTs, dropout, and shakeout on small datasets.

03

Whiteout effectively induces $l_{eta}$ sparsity regularization.

Abstract

Noise injection (NI) is an efficient technique to mitigate over-fitting in neural networks (NNs). The Bernoulli NI procedure as implemented in dropout and shakeout has connections with $l_{1}$ and $l_{2}$ regularization for the NN model parameters. We propose whiteout, a family NI regularization techniques (NIRT) through injecting adaptive Gaussian noises during the training of NNs. Whiteout is the first NIRT than imposes a broad range of the $l_{γ}$ sparsity regularization $(γ \in (0, 2))$ without having to involving the $l_{2}$ regularization. Whiteout can also be extended to offer regularizations similar to the adaptive lasso and group lasso. We establish the regularization effect of whiteout in the framework of generalized linear models with closed-form penalty terms and show that whiteout stabilizes the training of NNs with decreased sensitivity to small perturbations in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDropout