General bounds on the quality of Bayesian coresets

Trevor Campbell

arXiv:2405.11780·stat.ML·October 18, 2024·1 cites

General bounds on the quality of Bayesian coresets

Trevor Campbell

PDF

Open Access 1 Video

TL;DR

This paper establishes general theoretical bounds on the approximation error of Bayesian coresets, broadening understanding of their limitations and performance across diverse models, including complex and heavy-tailed distributions.

Contribution

It provides the first general upper and lower bounds on coreset approximation quality applicable to a wide range of Bayesian models, beyond restrictive assumptions.

Findings

01

Lower bounds reveal fundamental limitations of coreset approximations.

02

Upper bounds analyze performance of recent subsample-optimize methods.

03

Experimental validation on complex Bayesian posteriors demonstrates theory's applicability.

Abstract

Bayesian coresets speed up posterior inference in the large-scale data regime by approximating the full-data log-likelihood function with a surrogate log-likelihood based on a small, weighted subset of the data. But while Bayesian coresets and methods for construction are applicable in a wide range of models, existing theoretical analysis of the posterior inferential error incurred by coreset approximations only apply in restrictive settings -- i.e., exponential family models, or models with strong log-concavity and smoothness assumptions. This work presents general upper and lower bounds on the Kullback-Leibler (KL) divergence of coreset approximations that reflect the full range of applicability of Bayesian coresets. The lower bounds require only mild model assumptions typical of Bayesian asymptotic analyses, while the upper bounds require the log-likelihood functions to satisfy a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

General bounds on the quality of Bayesian coresets· slideslive

Taxonomy

TopicsStatistical Methods and Inference · Bayesian Modeling and Causal Inference · Advanced Bandit Algorithms Research

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Coresets