Learning Synergies between Pushing and Grasping with Self-supervised   Deep Reinforcement Learning

Andy Zeng; Shuran Song; Stefan Welker; Johnny Lee; Alberto Rodriguez,; Thomas Funkhouser

arXiv:1803.09956·cs.RO·October 2, 2018

Learning Synergies between Pushing and Grasping with Self-supervised Deep Reinforcement Learning

Andy Zeng, Shuran Song, Stefan Welker, Johnny Lee, Alberto Rodriguez,, Thomas Funkhouser

PDF

4 Repos

TL;DR

This paper presents a self-supervised deep reinforcement learning approach for robotic manipulation that learns to synergize pushing and grasping actions, improving efficiency and success rates in cluttered environments.

Contribution

It introduces a joint learning framework for pushing and grasping using two convolutional networks trained via Q-learning, enabling robots to discover complex manipulation strategies from scratch.

Findings

01

Learned pushing actions facilitate better grasping in cluttered scenes.

02

Achieved higher grasp success rates compared to baseline methods.

03

System generalizes effectively to novel objects.

Abstract

Skilled robotic manipulation benefits from complex synergies between non-prehensile (e.g. pushing) and prehensile (e.g. grasping) actions: pushing can help rearrange cluttered objects to make space for arms and fingers; likewise, grasping can help displace objects to make pushing movements more precise and collision-free. In this work, we demonstrate that it is possible to discover and learn these synergies from scratch through model-free deep reinforcement learning. Our method involves training two fully convolutional networks that map from visual observations to actions: one infers the utility of pushes for a dense pixel-wise sampling of end effector orientations and locations, while the other does the same for grasping. Both networks are trained jointly in a Q-learning framework and are entirely self-supervised by trial and error, where rewards are provided from successful grasps. In…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsQ-Learning