Approximate Bayesian computation with the Wasserstein distance
Espen Bernton (Harvard University), Pierre E. Jacob (Harvard, University), Mathieu Gerber (University of Bristol), Christian P. Robert, (Universit\'e Paris-Dauphine, PSL, University of Warwick)

TL;DR
This paper introduces a likelihood-free Bayesian inference method using the Wasserstein distance to compare empirical distributions, enabling scalable and information-preserving analysis of complex generative models.
Contribution
It generalizes ABC by replacing summary statistics with Wasserstein distance, including a new Hilbert curve-based distance for improved scalability and theoretical guarantees.
Findings
Method performs well on univariate and multivariate examples
Scales to realistic data sizes with recent Wasserstein approximations
Provides theoretical consistency and concentration results
Abstract
A growing number of generative statistical models do not permit the numerical evaluation of their likelihood functions. Approximate Bayesian computation (ABC) has become a popular approach to overcome this issue, in which one simulates synthetic data sets given parameters and compares summaries of these data sets with the corresponding observed values. We propose to avoid the use of summaries and the ensuing loss of information by instead using the Wasserstein distance between the empirical distributions of the observed and synthetic data. This generalizes the well-known approach of using order statistics within ABC to arbitrary dimensions. We describe how recently developed approximations of the Wasserstein distance allow the method to scale to realistic data sizes, and propose a new distance based on the Hilbert space-filling curve. We provide a theoretical study of the proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Bayesian Methods and Mixture Models · Statistical Methods and Bayesian Inference
