Recurrent Kalman Networks: Factorized Inference in High-Dimensional Deep   Feature Spaces

Philipp Becker; Harit Pandya; Gregor Gebhardt; Cheng Zhao; James; Taylor; Gerhard Neumann

arXiv:1905.07357·cs.LG·May 20, 2019·30 cites

Recurrent Kalman Networks: Factorized Inference in High-Dimensional Deep Feature Spaces

Philipp Becker, Harit Pandya, Gregor Gebhardt, Cheng Zhao, James, Taylor, Gerhard Neumann

PDF

Open Access 3 Repos

TL;DR

Recurrent Kalman Networks (RKNs) introduce a scalable, end-to-end trainable deep filtering approach that explicitly models uncertainty in high-dimensional time-series data, outperforming traditional RNNs in uncertainty estimation and prediction accuracy.

Contribution

The paper presents RKNs, a novel deep filtering architecture that simplifies Kalman updates in high-dimensional spaces, enabling direct end-to-end training without approximations.

Findings

01

RKNs provide more accurate uncertainty estimates than LSTM and GRU.

02

RKNs achieve slightly better prediction performance.

03

RKNs outperform recent generative models on image imputation.

Abstract

In order to integrate uncertainty estimates into deep time-series modelling, Kalman Filters (KFs) (Kalman et al., 1960) have been integrated with deep learning models, however, such approaches typically rely on approximate inference techniques such as variational inference which makes learning more complex and often less scalable due to approximation errors. We propose a new deep approach to Kalman filtering which can be learned directly in an end-to-end manner using backpropagation without additional approximations. Our approach uses a high-dimensional factorized latent state representation for which the Kalman updates simplify to scalar operations and thus avoids hard to backpropagate, computationally heavy and potentially unstable matrix inversions. Moreover, we use locally linear dynamic models to efficiently propagate the latent state to the next time step. The resulting network…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Gaussian Processes and Bayesian Inference · Statistical and numerical algorithms

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory