Data-efficient Deep Reinforcement Learning for Dexterous Manipulation

Ivaylo Popov; Nicolas Heess; Timothy Lillicrap; Roland Hafner; Gabriel; Barth-Maron; Matej Vecerik; Thomas Lampe; Yuval Tassa; Tom Erez; Martin; Riedmiller

arXiv:1704.03073·cs.LG·April 12, 2017·118 cites

Data-efficient Deep Reinforcement Learning for Dexterous Manipulation

Ivaylo Popov, Nicolas Heess, Timothy Lillicrap, Roland Hafner, Gabriel, Barth-Maron, Matej Vecerik, Thomas Lampe, Yuval Tassa, Tom Erez, Martin, Riedmiller

PDF

Open Access

TL;DR

This paper enhances deep reinforcement learning algorithms to improve data efficiency and scalability for dexterous robotic manipulation tasks, demonstrating successful object grasping and stacking in simulation with potential for real-world application.

Contribution

Introduces two extensions to the DDPG algorithm that significantly improve data efficiency and scalability for robotic manipulation tasks.

Findings

01

Enhanced DDPG with off-policy data improves manipulation success

02

Robust policies for grasping and stacking demonstrated in simulation

03

Potential for real robot training suggested

Abstract

Deep learning and reinforcement learning methods have recently been used to solve a variety of problems in continuous control domains. An obvious application of these techniques is dexterous manipulation tasks in robotics which are difficult to solve using traditional control theory or hand-engineered approaches. One example of such a task is to grasp an object and precisely stack it on another. Solving this difficult and practically relevant problem in the real world is an important long-term goal for the field of robotics. Here we take a step towards this goal by examining the problem in simulation and providing models and techniques aimed at solving it. We introduce two extensions to the Deep Deterministic Policy Gradient algorithm (DDPG), a model-free Q-learning based method, which make it significantly more data-efficient and scalable. Our results show that by making extensive use…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Adversarial Robustness in Machine Learning

MethodsQ-Learning