Efficient Learning of Deep State Space Models via Importance Smoothing

John-Joseph Brady; Nikolas Nusken; Yunpeng Li

arXiv:2605.21108·cs.LG·May 21, 2026

Efficient Learning of Deep State Space Models via Importance Smoothing

John-Joseph Brady, Nikolas Nusken, Yunpeng Li

PDF

TL;DR

This paper introduces a novel parallel variational Monte Carlo method for efficiently training deep state space models, achieving faster training and improved results over existing approaches.

Contribution

The authors propose a new parallel training method that combines auto-encoding and sequential Monte Carlo techniques for scalable deep state space models.

Findings

01

Achieves state-of-the-art or better results on baseline experiments.

02

Trains 10 times faster than the fastest existing SMC approach.

03

Robustly trains DSSMs for both discriminative and generative tasks.

Abstract

Latent state space systems are ubiquitous in statistical modelling, arising naturally when a time series is observed through a noisy measurement function, however training deep state space models (DSSM) at scale remains difficult. Two largely distinct strategies and literatures have developed around the training of DSSMs. Firstly, auto-encoding DSSMs train generative DSSMs by optimising a variational lower bound. Secondly, DSSMs trained by back-propagating the outputs of a classical sequential Monte Carlo algorithm (SMC). Such approaches can train DSSMs for discriminative as well as generative tasks, however, due to the sequentiality of their forward pass, scale poorly on modern hardware. We propose a new training method \emph{parallel variational Monte Carlo} (PVMC) that bridges the gap between the paradigms, and can be used robustly to train DSSMs for both discriminative and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.