Kalman filter control in the reinforcement learning framework

Istvan Szita; Andras Lorincz

arXiv:cs/0301007·cs.LG·May 23, 2007

Kalman filter control in the reinforcement learning framework

Istvan Szita, Andras Lorincz

PDF

Open Access

TL;DR

This paper demonstrates how to adapt Kalman-filter models for reinforcement learning, enabling online optimal control estimation with a Hebbian learning rule for value updates, bridging control theory and learning algorithms.

Contribution

It introduces a modification to the linear-quadratic-Gaussian Kalman-filter model that allows online control estimation and integrates reinforcement learning principles.

Findings

01

Enables online estimation of optimal control using Kalman filters.

02

Introduces a Hebbian learning rule for value estimation.

03

Bridges Kalman filtering with reinforcement learning frameworks.

Abstract

There is a growing interest in using Kalman-filter models in brain modelling. In turn, it is of considerable importance to make Kalman-filters amenable for reinforcement learning. In the usual formulation of optimal control it is computed off-line by solving a backward recursion. In this technical note we show that slight modification of the linear-quadratic-Gaussian Kalman-filter model allows the on-line estimation of optimal control and makes the bridge to reinforcement learning. Moreover, the learning rule for value estimation assumes a Hebbian form weighted by the error of the value estimation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural dynamics and brain function · Motor Control and Adaptation · Cognitive Science and Mapping