Learning Visual Affordances with Target-Orientated Deep Q-Network to   Grasp Objects by Harnessing Environmental Fixtures

Hengyue Liang; Xibai Lou; Yang Yang; Changhyun Choi

arXiv:1910.03781·cs.RO·April 6, 2021·1 cites

Learning Visual Affordances with Target-Orientated Deep Q-Network to Grasp Objects by Harnessing Environmental Fixtures

Hengyue Liang, Xibai Lou, Yang Yang, Changhyun Choi

PDF

Open Access

TL;DR

This paper presents a self-supervised deep reinforcement learning approach for robotic grasping that leverages environmental fixtures, using a novel Target-Oriented Deep Q-Network to learn visual affordances for complex object grasping tasks.

Contribution

It introduces a new visual affordance learning method with TO-DQN, enabling robots to grasp objects using environmental fixtures without prior knowledge.

Findings

01

TO-DQN outperforms standard DQN in training efficiency and robustness.

02

The learned policy achieves human-comparable performance in simulation and real-world tests.

03

The approach effectively generalizes across different environment settings.

Abstract

This paper introduces a challenging object grasping task and proposes a self-supervised learning approach. The goal of the task is to grasp an object which is not feasible with a single parallel gripper, but only with harnessing environment fixtures (e.g., walls, furniture, heavy objects). This Slide-to-Wall grasping task assumes no prior knowledge except the partial observation of a target object. Hence the robot should learn an effective policy given a scene observation that may include the target object, environmental fixtures, and any other disturbing objects. We formulate the problem as visual affordances learning for which Target-Oriented Deep Q-Network (TO-DQN) is proposed to efficiently learn visual affordance maps (i.e., Q-maps) to guide robot actions. Since the training necessitates robot's exploration and collision with the fixtures, TO-DQN is first trained safely with a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Robotics and Sensor-Based Localization · Domain Adaptation and Few-Shot Learning

MethodsQ-Learning · Dense Connections · Convolution · Deep Q-Network