Adaptive Bases for Reinforcement Learning

Dotan Di Castro; Shie Mannor

arXiv:1005.0125·cs.LG·May 4, 2010·1 cites

Adaptive Bases for Reinforcement Learning

Dotan Di Castro, Shie Mannor

PDF

Open Access

TL;DR

This paper introduces an adaptive basis method for reinforcement learning with function approximation, dynamically adjusting the basis to improve value function accuracy, convergence, and performance in simulated environments.

Contribution

It proposes a novel adaptive basis approach within the actor-critic framework that converges and enhances reinforcement learning performance.

Findings

01

Adaptive basis improves value function approximation.

02

Algorithms converge under the proposed framework.

03

Demonstrated performance gains in simulations.

Abstract

We consider the problem of reinforcement learning using function approximation, where the approximating basis can change dynamically while interacting with the environment. A motivation for such an approach is maximizing the value function fitness to the problem faced. Three errors are considered: approximation square error, Bellman residual, and projected Bellman residual. Algorithms under the actor-critic framework are presented, and shown to converge. The advantage of such an adaptive basis is demonstrated in simulations.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Adaptive Dynamic Programming Control