Data-efficient visuomotor policy training using reinforcement learning   and generative models

Ali Ghadirzadeh; Petra Poklukar; Ville Kyrki; Danica Kragic and; M{\aa}rten Bj\"orkman

arXiv:2007.13134·cs.RO·November 9, 2020·5 cites

Data-efficient visuomotor policy training using reinforcement learning and generative models

Ali Ghadirzadeh, Petra Poklukar, Ville Kyrki, Danica Kragic and, M{\aa}rten Bj\"orkman

PDF

Open Access

TL;DR

This paper introduces a data-efficient framework combining reinforcement learning and generative models for visuomotor tasks, improving policy training efficiency and safety in robotic applications.

Contribution

It proposes a novel three-part training approach integrating latent variables, generative models, and supervised learning for visuomotor policies.

Findings

01

Enables safe exploration and reduces data requirements.

02

Provides measures to predict RL policy performance before physical training.

03

Analyzes generative model characteristics affecting policy success.

Abstract

We present a data-efficient framework for solving visuomotor sequential decision-making problems which exploits the combination of reinforcement learning (RL) and latent variable generative models. Our framework trains deep visuomotor policies by introducing an action latent variable such that the feed-forward policy search can be divided into three parts: (i) training a sub-policy that outputs a distribution over the action latent variable given a state of the system, (ii) unsupervised training of a generative model that outputs a sequence of motor actions conditioned on the latent action variable, and (iii) supervised training of the deep visuomotor policy in an end-to-end fashion. Our approach enables safe exploration and alleviates the data-inefficiency problem as it exploits prior knowledge about valid sequences of motor actions. Moreover, we provide a set of measures for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Autonomous Vehicle Technology and Safety