State and parameter learning with PaRIS particle Gibbs
Gabriel Cardoso, Yazid Janati El Idrissi, Sylvain Le Corff, Eric, Moulines, Jimmy Olsson

TL;DR
This paper introduces the Parisian particle Gibbs (PPG) sampler, a novel bias-reduced smoothing algorithm for non-linear state-space models, with theoretical guarantees and applications to likelihood estimation and score climbing.
Contribution
The paper proposes the PPG sampler, combining PaRIS with conditional SMC, providing bias reduction, theoretical bounds, and applying it to learning tasks like MLE and MSC.
Findings
PPG reduces bias compared to standard PaRIS.
Theoretical bounds on bias, variance, and deviation established.
Numerical experiments confirm the effectiveness of PPG.
Abstract
Non-linear state-space models, also known as general hidden Markov models, are ubiquitous in statistical machine learning, being the most classical generative models for serial data and sequences in general. The particle-based, rapid incremental smoother PaRIS is a sequential Monte Carlo (SMC) technique allowing for efficient online approximation of expectations of additive functionals under the smoothing distribution in these models. Such expectations appear naturally in several learning contexts, such as likelihood estimation (MLE) and Markov score climbing (MSC). PARIS has linear computational complexity, limited memory requirements and comes with non-asymptotic bounds, convergence results and stability guarantees. Still, being based on self-normalised importance sampling, the PaRIS estimator is biased. Our first contribution is to design a novel additive smoothing algorithm, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Gaussian Processes and Bayesian Inference · Target Tracking and Data Fusion in Sensor Networks
