An Adaptive Empirical Bayesian Method for Sparse Deep Learning
Wei Deng, Xiao Zhang, Faming Liang, Guang Lin

TL;DR
This paper introduces an adaptive empirical Bayesian approach for sparse deep learning that employs self-adaptive spike-and-slab priors, combining stochastic gradient MCMC and stochastic approximation for hyperparameter optimization, achieving state-of-the-art results and enhanced robustness.
Contribution
It presents a novel adaptive Bayesian framework with convergence guarantees, improving sparsity, performance, and adversarial resistance in deep neural networks.
Findings
State-of-the-art accuracy on MNIST and Fashion MNIST
Superior compression on CIFAR10 with Residual Networks
Enhanced resistance to adversarial attacks
Abstract
We propose a novel adaptive empirical Bayesian method for sparse deep learning, where the sparsity is ensured via a class of self-adaptive spike-and-slab priors. The proposed method works by alternatively sampling from an adaptive hierarchical posterior distribution using stochastic gradient Markov Chain Monte Carlo (MCMC) and smoothly optimizing the hyperparameters using stochastic approximation (SA). We further prove the convergence of the proposed method to the asymptotically correct distribution under mild conditions. Empirical applications of the proposed method lead to the state-of-the-art performance on MNIST and Fashion MNIST with shallow convolutional neural networks and the state-of-the-art compression performance on CIFAR10 with Residual Networks. The proposed method also improves resistance to adversarial attacks.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning · Gaussian Processes and Bayesian Inference
