Online Learning of the Kalman Filter with Logarithmic Regret
Anastasios Tsiamis, George Pappas

TL;DR
This paper introduces an online learning algorithm for the Kalman filter that guarantees logarithmic regret in prediction accuracy, even for non-explosive systems, advancing the theoretical understanding of adaptive filtering.
Contribution
It provides the first logarithmic regret guarantees for the Kalman filter in an online setting, applicable to non-explosive and marginally stable systems.
Findings
Achieves polylogarithmic regret with high probability.
Applicable to systems with unknown noise statistics.
Extends analysis to non-explosive, marginally stable systems.
Abstract
In this paper, we consider the problem of predicting observations generated online by an unknown, partially observed linear system, which is driven by stochastic noise. For such systems the optimal predictor in the mean square sense is the celebrated Kalman filter, which can be explicitly computed when the system model is known. When the system model is unknown, we have to learn how to predict observations online based on finite data, suffering possibly a non-zero regret with respect to the Kalman filter's prediction. We show that it is possible to achieve a regret of the order of with high probability, where is the number of observations collected. Our work is the first to provide logarithmic regret guarantees for the widely used Kalman filter. This is achieved using an online least-squares algorithm, which exploits the approximately linear relation between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Distributed Sensor Networks and Detection Algorithms · Gaussian Processes and Bayesian Inference
