A Geometric View of Conjugate Priors

Arvind Agarwal; Hal Daume III

arXiv:1005.0047·cs.LG·May 4, 2010

A Geometric View of Conjugate Priors

Arvind Agarwal, Hal Daume III

PDF

Open Access

TL;DR

This paper provides a geometric interpretation of conjugate priors in Bayesian learning, framing them through Bregman divergence to enhance understanding and derive hyperparameters for hybrid models.

Contribution

It introduces a geometric perspective on conjugate priors using Bregman divergence, offering new insights and methods for hyperparameter derivation in semi-supervised learning.

Findings

01

Conjugate priors can be viewed as geometric entities defined by Bregman divergence.

02

Hyperparameters correspond to effective sample points in the geometric space.

03

The approach aids in deriving priors for hybrid generative-discriminative models.

Abstract

In Bayesian machine learning, conjugate priors are popular, mostly due to mathematical convenience. In this paper, we show that there are deeper reasons for choosing a conjugate prior. Specifically, we formulate the conjugate prior in the form of Bregman divergence and show that it is the inherent geometry of conjugate priors that makes them appropriate and intuitive. This geometric interpretation allows one to view the hyperparameters of conjugate priors as the {\it effective} sample points, thus providing additional intuition. We use this geometric understanding of conjugate priors to derive the hyperparameters and expression of the prior used to couple the generative and discriminative components of a hybrid model for semi-supervised learning.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Bayesian Modeling and Causal Inference · Bayesian Methods and Mixture Models