Continuous Value Iteration (CVI) Reinforcement Learning and Imaginary   Experience Replay (IER) for learning multi-goal, continuous action and state   space controllers

Andreas Gerken; Michael Spranger

arXiv:1908.10255·cs.AI·August 28, 2019

Continuous Value Iteration (CVI) Reinforcement Learning and Imaginary Experience Replay (IER) for learning multi-goal, continuous action and state space controllers

Andreas Gerken, Michael Spranger

PDF

1 Repo

TL;DR

This paper introduces a new model-free reinforcement learning algorithm that efficiently learns multi-goal controllers in continuous spaces, utilizing non-parametric value function approximation and a novel sample augmentation technique to enhance generalization and learning speed.

Contribution

It proposes a novel RL algorithm combining continuous value iteration and imaginary experience replay, improving multi-goal learning in continuous spaces with better generalization.

Findings

01

Faster learning in simulation and real-world robot tasks.

02

Effective multi-goal control in continuous action and state spaces.

03

Enhanced generalization through sample augmentation.

Abstract

This paper presents a novel model-free Reinforcement Learning algorithm for learning behavior in continuous action, state, and goal spaces. The algorithm approximates optimal value functions using non-parametric estimators. It is able to efficiently learn to reach multiple arbitrary goals in deterministic and nondeterministic environments. To improve generalization in the goal space, we propose a novel sample augmentation technique. Using these methods, robots learn faster and overall better controllers. We benchmark the proposed algorithms using simulation and a real-world voltage controlled robot that learns to maneuver in a non-observable Cartesian task space.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mauriciogtec/cvi
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.