Expectation Propagation for Neural Networks with Sparsity-promoting Priors
Pasi Jyl\"anki, Aapo Nummenmaa, Aki Vehtari

TL;DR
This paper introduces an expectation propagation method for nonlinear regression with neural networks that incorporate sparsity-promoting hierarchical priors, enabling efficient Bayesian inference and flexible model structures.
Contribution
It develops a computationally efficient EP-based algorithm for neural networks with hierarchical sparsity priors, extending flexibility in prior choices and model structures.
Findings
The approach performs well on simulated data.
It achieves competitive results on real-world datasets.
The method scales similarly to sparse linear models.
Abstract
We propose a novel approach for nonlinear regression using a two-layer neural network (NN) model structure with sparsity-favoring hierarchical priors on the network weights. We present an expectation propagation (EP) approach for approximate integration over the posterior distribution of the weights, the hierarchical scale parameters of the priors, and the residual scale. Using a factorized posterior approximation we derive a computationally efficient algorithm, whose complexity scales similarly to an ensemble of independent sparse linear models. The approach enables flexible definition of weight priors with different sparseness properties such as independent Laplace priors with a common scale parameter or Gaussian automatic relevance determination (ARD) priors with different relevance parameters for all inputs. The approach can be extended beyond standard activation functions and NN…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Statistical Methods and Inference · Sparse and Compressive Sensing Techniques
MethodsGaussian Process
