Multilevel Sampling in Algebraic Statistics
Nathan Kirk, Ivan Gvozdanovi\'c, Sonja Petrovi\'c

TL;DR
This paper introduces a multilevel sampling algorithm for fiber sampling in algebraic statistics, improving efficiency and sample quality assessment through novel metrics and outperforming traditional methods in benchmarks.
Contribution
The paper develops a multilevel sampling algorithm tailored for algebraic statistics, incorporating variable step sizes and new quality metrics like FCS and MMD.
Findings
Multilevel sampling outperforms naive MCMC in benchmarks.
FCS and MMD effectively assess sample quality.
Algorithm accelerates exploration and reduces burn-in.
Abstract
This paper proposes a multilevel sampling algorithm for fiber sampling problems in algebraic statistics, inspired by Henry Wynn's suggestion to adapt multilevel Monte Carlo (MLMC) ideas to discrete models. Focusing on log-linear models, we sample from high-dimensional lattice fibers defined by algebraic constraints. Building on Markov basis methods and results from Diaconis and Sturmfels, our algorithm uses variable step sizes to accelerate exploration and reduce the need for long burn-in. We introduce a novel Fiber Coverage Score (FCS) based on Voronoi partitioning to assess sample quality, and highlight the utility of the Maximum Mean Discrepancy (MMD) quality metric. Simulations on benchmark fibers show that multilevel sampling outperforms naive MCMC approaches. Our results demonstrate that multilevel methods, when properly applied, provide practical benefits for discrete sampling in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Statistical Methods and Inference · Stochastic Gradient Optimization Techniques
