Whiteout: Gaussian Adaptive Noise Regularization in Deep Neural Networks
Yinan Li, Fang Liu

TL;DR
Whiteout introduces a novel Gaussian noise regularization technique for deep neural networks that promotes sparsity and stabilizes training without relying on traditional $l_2$ regularization, demonstrating superior performance especially on small datasets.
Contribution
It is the first to thoroughly analyze and develop Gaussian noise-based regularization for deep NNs, extending to adaptive lasso and group lasso, with theoretical and empirical validation.
Findings
Whiteout stabilizes neural network training.
Whiteout outperforms Bernoulli NIRTs, dropout, and shakeout on small datasets.
Whiteout effectively induces $l_{eta}$ sparsity regularization.
Abstract
Noise injection (NI) is an efficient technique to mitigate over-fitting in neural networks (NNs). The Bernoulli NI procedure as implemented in dropout and shakeout has connections with and regularization for the NN model parameters. We propose whiteout, a family NI regularization techniques (NIRT) through injecting adaptive Gaussian noises during the training of NNs. Whiteout is the first NIRT than imposes a broad range of the sparsity regularization without having to involving the regularization. Whiteout can also be extended to offer regularizations similar to the adaptive lasso and group lasso. We establish the regularization effect of whiteout in the framework of generalized linear models with closed-form penalty terms and show that whiteout stabilizes the training of NNs with decreased sensitivity to small perturbations in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDropout
