Data-efficient visuomotor policy training using reinforcement learning and generative models
Ali Ghadirzadeh, Petra Poklukar, Ville Kyrki, Danica Kragic and, M{\aa}rten Bj\"orkman

TL;DR
This paper introduces a data-efficient framework combining reinforcement learning and generative models for visuomotor tasks, improving policy training efficiency and safety in robotic applications.
Contribution
It proposes a novel three-part training approach integrating latent variables, generative models, and supervised learning for visuomotor policies.
Findings
Enables safe exploration and reduces data requirements.
Provides measures to predict RL policy performance before physical training.
Analyzes generative model characteristics affecting policy success.
Abstract
We present a data-efficient framework for solving visuomotor sequential decision-making problems which exploits the combination of reinforcement learning (RL) and latent variable generative models. Our framework trains deep visuomotor policies by introducing an action latent variable such that the feed-forward policy search can be divided into three parts: (i) training a sub-policy that outputs a distribution over the action latent variable given a state of the system, (ii) unsupervised training of a generative model that outputs a sequence of motor actions conditioned on the latent action variable, and (iii) supervised training of the deep visuomotor policy in an end-to-end fashion. Our approach enables safe exploration and alleviates the data-inefficiency problem as it exploits prior knowledge about valid sequences of motor actions. Moreover, we provide a set of measures for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Autonomous Vehicle Technology and Safety
