An Equivalence between Bayesian Priors and Penalties in Variational   Inference

Pierre Wolinski; Guillaume Charpiat; Yann Ollivier

arXiv:2002.00178·cs.LG·February 8, 2024·1 cites

An Equivalence between Bayesian Priors and Penalties in Variational Inference

Pierre Wolinski, Guillaume Charpiat, Yann Ollivier

PDF

Open Access

TL;DR

This paper establishes a formal connection between Bayesian priors and regularization penalties in variational inference, providing a systematic method to identify the prior from a given penalty function.

Contribution

It characterizes which regularizers correspond to Bayesian priors in variational inference and offers a way to compute the prior from a specified penalty.

Findings

01

Characterization of regularizers that correspond to Bayesian priors.

02

Method to compute the prior from a given penalty function.

03

Guidelines for designing penalties that preserve Bayesian properties.

Abstract

In machine learning, it is common to optimize the parameters of a probabilistic model, modulated by an ad hoc regularization term that penalizes some values of the parameters. Regularization terms appear naturally in Variational Inference, a tractable way to approximate Bayesian posteriors: the loss to optimize contains a Kullback--Leibler divergence term between the approximate posterior and a Bayesian prior. We fully characterize the regularizers that can arise according to this procedure, and provide a systematic way to compute the prior corresponding to a given penalty. Such a characterization can be used to discover constraints over the penalty function, so that the overall procedure remains Bayesian.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Advanced Multi-Objective Optimization Algorithms · Statistical Methods and Inference