Temporal-Difference Networks for Dynamical Systems with Continuous   Observations and Actions

Christopher M. Vigorito

arXiv:1205.2608·cs.LG·May 14, 2012·2 cites

Temporal-Difference Networks for Dynamical Systems with Continuous Observations and Actions

Christopher M. Vigorito

PDF

Open Access

TL;DR

This paper introduces an incremental algorithm for learning predictive models of continuous dynamical systems using temporal-difference networks, extending previous finite-set methods to continuous observations and actions.

Contribution

It presents the first fully incremental method for TD networks applied to continuous dynamical systems, enabling robust modeling of noisy, real-world systems.

Findings

01

Successfully learned accurate models of noisy continuous systems

02

Demonstrated robustness of the algorithm in various scenarios

03

Extended TD networks to continuous observation and action spaces

Abstract

Temporal-difference (TD) networks are a class of predictive state representations that use well-established TD methods to learn models of partially observable dynamical systems. Previous research with TD networks has dealt only with dynamical systems with finite sets of observations and actions. We present an algorithm for learning TD network representations of dynamical systems with continuous observations and actions. Our results show that the algorithm is capable of learning accurate and robust models of several noisy continuous dynamical systems. The algorithm presented here is the first fully incremental method for learning a predictive representation of a continuous dynamical system.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Reinforcement Learning in Robotics · Neural Networks and Applications