Online Learning of the Kalman Filter with Logarithmic Regret

Anastasios Tsiamis; George Pappas

arXiv:2002.05141·cs.LG·February 13, 2020·1 cites

Online Learning of the Kalman Filter with Logarithmic Regret

Anastasios Tsiamis, George Pappas

PDF

Open Access

TL;DR

This paper introduces an online learning algorithm for the Kalman filter that guarantees logarithmic regret in prediction accuracy, even for non-explosive systems, advancing the theoretical understanding of adaptive filtering.

Contribution

It provides the first logarithmic regret guarantees for the Kalman filter in an online setting, applicable to non-explosive and marginally stable systems.

Findings

01

Achieves polylogarithmic regret with high probability.

02

Applicable to systems with unknown noise statistics.

03

Extends analysis to non-explosive, marginally stable systems.

Abstract

In this paper, we consider the problem of predicting observations generated online by an unknown, partially observed linear system, which is driven by stochastic noise. For such systems the optimal predictor in the mean square sense is the celebrated Kalman filter, which can be explicitly computed when the system model is known. When the system model is unknown, we have to learn how to predict observations online based on finite data, suffering possibly a non-zero regret with respect to the Kalman filter's prediction. We show that it is possible to achieve a regret of the order of $poly lo g (N)$ with high probability, where $N$ is the number of observations collected. Our work is the first to provide logarithmic regret guarantees for the widely used Kalman filter. This is achieved using an online least-squares algorithm, which exploits the approximately linear relation between…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Distributed Sensor Networks and Detection Algorithms · Gaussian Processes and Bayesian Inference