Efficient Learning of the Parameters of Non-Linear Models using Differentiable Resampling in Particle Filters
Conor Rosato, Vincent Beraud, Paul Horridge, Thomas B. Sch\"on, Simon, Maskell

TL;DR
This paper introduces a differentiable resampling technique for particle filters, enabling gradient-based parameter learning and improved MCMC sampling efficiency in state-space models.
Contribution
It extends the reparameterisation trick to resampling steps, allowing for differentiable particle filters and more effective parameter estimation using advanced MCMC methods.
Findings
NUTS improves chain mixing and accuracy.
Differentiable resampling enables gradient-based inference.
Enhanced efficiency in parameter estimation for state-space models.
Abstract
It has been widely documented that the sampling and resampling steps in particle filters cannot be differentiated. The {\itshape reparameterisation trick} was introduced to allow the sampling step to be reformulated into a differentiable function. We extend the {\itshape reparameterisation trick} to include the stochastic input to resampling therefore limiting the discontinuities in the gradient calculation after this step. Knowing the gradients of the prior and likelihood allows us to run particle Markov Chain Monte Carlo (p-MCMC) and use the No-U-Turn Sampler (NUTS) as the proposal when estimating parameters. We compare the Metropolis-adjusted Langevin algorithm (MALA), Hamiltonian Monte Carlo with different number of steps and NUTS. We consider two state-space models and show that NUTS improves the mixing of the Markov chain and can produce more accurate results in less…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTarget Tracking and Data Fusion in Sensor Networks · Markov Chains and Monte Carlo Methods · Gaussian Processes and Bayesian Inference
