Efficient Learning of the Parameters of Non-Linear Models using   Differentiable Resampling in Particle Filters

Conor Rosato; Vincent Beraud; Paul Horridge; Thomas B. Sch\"on; Simon; Maskell

arXiv:2111.01409·stat.ML·August 10, 2022

Efficient Learning of the Parameters of Non-Linear Models using Differentiable Resampling in Particle Filters

Conor Rosato, Vincent Beraud, Paul Horridge, Thomas B. Sch\"on, Simon, Maskell

PDF

Open Access

TL;DR

This paper introduces a differentiable resampling technique for particle filters, enabling gradient-based parameter learning and improved MCMC sampling efficiency in state-space models.

Contribution

It extends the reparameterisation trick to resampling steps, allowing for differentiable particle filters and more effective parameter estimation using advanced MCMC methods.

Findings

01

NUTS improves chain mixing and accuracy.

02

Differentiable resampling enables gradient-based inference.

03

Enhanced efficiency in parameter estimation for state-space models.

Abstract

It has been widely documented that the sampling and resampling steps in particle filters cannot be differentiated. The {\itshape reparameterisation trick} was introduced to allow the sampling step to be reformulated into a differentiable function. We extend the {\itshape reparameterisation trick} to include the stochastic input to resampling therefore limiting the discontinuities in the gradient calculation after this step. Knowing the gradients of the prior and likelihood allows us to run particle Markov Chain Monte Carlo (p-MCMC) and use the No-U-Turn Sampler (NUTS) as the proposal when estimating parameters. We compare the Metropolis-adjusted Langevin algorithm (MALA), Hamiltonian Monte Carlo with different number of steps and NUTS. We consider two state-space models and show that NUTS improves the mixing of the Markov chain and can produce more accurate results in less…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTarget Tracking and Data Fusion in Sensor Networks · Markov Chains and Monte Carlo Methods · Gaussian Processes and Bayesian Inference