Insufficient Gibbs Sampling
Antoine Luciano, Christian P. Robert, Robin J. Ryder

TL;DR
This paper introduces a Gibbs sampling approach to estimate posterior distributions of parameters using only robust, aggregated statistics, addressing privacy constraints in data analysis.
Contribution
It proposes a novel method to perform Bayesian inference with limited data summaries, enabling posterior sampling and model comparison under privacy-preserving conditions.
Findings
Effective in toy examples and real income data
Allows estimation of Bayes factors from summary statistics
Highlights limitations and potential of the approach
Abstract
In some applied scenarios, the availability of complete data is restricted, often due to privacy concerns; only aggregated, robust and inefficient statistics derived from the data are made accessible. These robust statistics are not sufficient, but they demonstrate reduced sensitivity to outliers and offer enhanced data protection due to their higher breakdown point. We consider a parametric framework and propose a method to sample from the posterior distribution of parameters conditioned on various robust and inefficient statistics: specifically, the pairs (median, MAD) or (median, IQR), or a collection of quantiles. Our approach leverages a Gibbs sampler and simulates latent augmented data, which facilitates simulation from the posterior distribution of parameters belonging to specific families of distributions. A by-product of these samples from the joint posterior distribution of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Advanced Statistical Process Monitoring · Advanced Statistical Methods and Models
