On Divergence Measures for Bayesian Pseudocoresets

Balhae Kim; Jungwon Choi; Seanie Lee; Yoonho Lee; Jung-Woo Ha; Juho; Lee

arXiv:2210.06205·cs.LG·October 13, 2022

On Divergence Measures for Bayesian Pseudocoresets

Balhae Kim, Jungwon Choi, Seanie Lee, Yoonho Lee, Jung-Woo Ha, Juho, Lee

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper explores divergence measures for constructing Bayesian pseudocoresets, proposing a new algorithm based on forward KL divergence, and demonstrates their effectiveness in high-dimensional Bayesian inference tasks.

Contribution

It unifies divergence-based approaches for Bayesian pseudocoreset construction and introduces a novel method using forward KL divergence.

Findings

01

Pseudocoresets reflect the true posterior in high-dimensional problems

02

Existing divergence measures can be interpreted within a unified framework

03

The new forward KL-based method performs well in empirical tests

Abstract

A Bayesian pseudocoreset is a small synthetic dataset for which the posterior over parameters approximates that of the original dataset. While promising, the scalability of Bayesian pseudocoresets is not yet validated in realistic problems such as image classification with deep neural networks. On the other hand, dataset distillation methods similarly construct a small dataset such that the optimization using the synthetic dataset converges to a solution with performance competitive with optimization using full data. Although dataset distillation has been empirically verified in large-scale settings, the framework is restricted to point estimates, and their adaptation to Bayesian inference has not been explored. This paper casts two representative dataset distillation algorithms as approximations to methods for constructing pseudocoresets by minimizing specific divergence measures:…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

balhaekim/bpc-divergences
pytorchOfficial

Videos

On Divergence Measures for Bayesian Pseudocoresets· slideslive

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Machine Learning and Data Classification