An Adaptive Empirical Bayesian Method for Sparse Deep Learning

Wei Deng; Xiao Zhang; Faming Liang; Guang Lin

arXiv:1910.10791·stat.ML·April 15, 2020

An Adaptive Empirical Bayesian Method for Sparse Deep Learning

Wei Deng, Xiao Zhang, Faming Liang, Guang Lin

PDF

Open Access 1 Repo

TL;DR

This paper introduces an adaptive empirical Bayesian approach for sparse deep learning that employs self-adaptive spike-and-slab priors, combining stochastic gradient MCMC and stochastic approximation for hyperparameter optimization, achieving state-of-the-art results and enhanced robustness.

Contribution

It presents a novel adaptive Bayesian framework with convergence guarantees, improving sparsity, performance, and adversarial resistance in deep neural networks.

Findings

01

State-of-the-art accuracy on MNIST and Fashion MNIST

02

Superior compression on CIFAR10 with Residual Networks

03

Enhanced resistance to adversarial attacks

Abstract

We propose a novel adaptive empirical Bayesian method for sparse deep learning, where the sparsity is ensured via a class of self-adaptive spike-and-slab priors. The proposed method works by alternatively sampling from an adaptive hierarchical posterior distribution using stochastic gradient Markov Chain Monte Carlo (MCMC) and smoothly optimizing the hyperparameters using stochastic approximation (SA). We further prove the convergence of the proposed method to the asymptotically correct distribution under mild conditions. Empirical applications of the proposed method lead to the state-of-the-art performance on MNIST and Fashion MNIST with shallow convolutional neural networks and the state-of-the-art compression performance on CIFAR10 with Residual Networks. The proposed method also improves resistance to adversarial attacks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

WayneDW/Bayesian-Sparse-Deep-Learning
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning · Gaussian Processes and Bayesian Inference