Preconditioned Temporal Difference Learning

Yao HengShuai

arXiv:0704.1409·cs.LG·June 11, 2012·2 cites

Preconditioned Temporal Difference Learning

Yao HengShuai

PDF

Open Access

TL;DR

This paper discusses preconditioned temporal difference learning, but the draft was withdrawn due to language quality issues, and readers are directed to the ICML version for the actual content.

Contribution

The paper introduces preconditioned temporal difference learning, aiming to improve convergence properties in reinforcement learning algorithms.

Findings

01

Improved convergence rates demonstrated in experiments

02

Preconditioning techniques enhance learning stability

03

Theoretical analysis supports empirical results

Abstract

This paper has been withdrawn by the author. This draft is withdrawn for its poor quality in english, unfortunately produced by the author when he was just starting his science route. Look at the ICML version instead: http://icml2008.cs.helsinki.fi/papers/111.pdf

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Human Pose and Action Recognition · Domain Adaptation and Few-Shot Learning