Learning the Kalman Filter with Fine-Grained Sample Complexity

Xiangyuan Zhang; Bin Hu; Tamer Ba\c{s}ar

arXiv:2301.12624·math.OC·March 1, 2023

Learning the Kalman Filter with Fine-Grained Sample Complexity

Xiangyuan Zhang, Bin Hu, Tamer Ba\c{s}ar

PDF

Open Access

TL;DR

This paper introduces a model-free policy gradient method for Kalman filtering, achieving near-optimal sample complexity without prior system stability assumptions, advancing control in noisy and disturbed linear systems.

Contribution

It presents the first end-to-end sample complexity analysis for model-free policy gradient methods in Kalman filtering, introducing the receding-horizon framework without requiring system stability or prior filters.

Findings

01

Achieves $ ilde{\mathcal{O}}(\epsilon^{-2})$ sample complexity for learning stabilizing filters.

02

Does not require open-loop stability or prior stabilizing filters.

03

Applicable to systems with noisy and adversarial disturbances.

Abstract

We develop the first end-to-end sample complexity of model-free policy gradient (PG) methods in discrete-time infinite-horizon Kalman filtering. Specifically, we introduce the receding-horizon policy gradient (RHPG-KF) framework and demonstrate $\tilde{O} (ϵ^{- 2})$ sample complexity for RHPG-KF in learning a stabilizing filter that is $ϵ$ -close to the optimal Kalman filter. Notably, the proposed RHPG-KF framework does not require the system to be open-loop stable nor assume any prior knowledge of a stabilizing filter. Our results shed light on applying model-free PG methods to control a linear dynamical system where the state measurements could be corrupted by statistical noises and other (possibly adversarial) disturbances.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Target Tracking and Data Fusion in Sensor Networks · Gaussian Processes and Bayesian Inference