Large sample scaling analysis of the Zig-Zag algorithm for Bayesian inference
Sanket Agrawal, Joris Bierkens, Gareth O. Roberts

TL;DR
This paper analyzes the large-sample behavior of the Zig-Zag process for Bayesian inference, showing how sub-sampling affects convergence and providing theoretical insights into its efficiency and scalability for big data applications.
Contribution
It offers a large sample scaling analysis of the Zig-Zag process, characterizing its transient and stationary phases, and demonstrates potential computational speed-ups with sub-sampling.
Findings
Zig-Zag trajectories are approximated by ODE solutions in the transient phase.
Stationary phase results include weak convergence of the process.
Sub-sampling with control variates achieves O(1) cost for large datasets.
Abstract
Piecewise deterministic Markov processes provide scalable methods for sampling from the posterior distributions in big data settings by admitting principled sub-sampling strategies that do not bias the output. An important example is the Zig-Zag process of [Ann. Stats. 47 (2019) 1288 - 1320] where clever sub-sampling has been shown to produce an essentially independent sample at a cost that does not scale with the size of the data. However, sub-sampling also leads to slower convergence and poor mixing of the process, a behaviour which questions the promised scalability of the algorithm. We provide a large sample scaling analysis of the Zig-Zag process and its sub-sampling versions in settings of parametric Bayesian inference. In the transient phase of the algorithm, we show that the Zig-Zag trajectories are well approximated by the solution to a system of ODEs. These ODEs possess a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Fault Detection and Control Systems · Target Tracking and Data Fusion in Sensor Networks
