Coarse-to-Fine Q-attention: Efficient Learning for Visual Robotic   Manipulation via Discretisation

Stephen James; Kentaro Wada; Tristan Laidlow; Andrew J. Davison

arXiv:2106.12534·cs.RO·March 16, 2022

Coarse-to-Fine Q-attention: Efficient Learning for Visual Robotic Manipulation via Discretisation

Stephen James, Kentaro Wada, Tristan Laidlow, Andrew J. Davison

PDF

Open Access 1 Repo

TL;DR

This paper introduces a coarse-to-fine Q-attention method that discretizes the scene for efficient reinforcement learning in robotic manipulation, achieving state-of-the-art results with minimal data and rapid training.

Contribution

It presents a novel discretization approach enabling discrete RL in robotics, improving stability and data efficiency over actor-critic methods.

Findings

01

Achieves state-of-the-art performance on RLBench tasks.

02

Trains real-world policies in minutes with few demonstrations.

03

Enables near-lossless discretization of translation space.

Abstract

We present a coarse-to-fine discretisation method that enables the use of discrete reinforcement learning approaches in place of unstable and data-inefficient actor-critic methods in continuous robotics domains. This approach builds on the recently released ARM algorithm, which replaces the continuous next-best pose agent with a discrete one, with coarse-to-fine Q-attention. Given a voxelised scene, coarse-to-fine Q-attention learns what part of the scene to 'zoom' into. When this 'zooming' behaviour is applied iteratively, it results in a near-lossless discretisation of the translation space, and allows the use of a discrete action, deep Q-learning method. We show that our new coarse-to-fine algorithm achieves state-of-the-art performance on several difficult sparsely rewarded RLBench vision-based robotics tasks, and can train real-world policies, tabula rasa, in a matter of minutes,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

stepjam/ARM
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Reinforcement Learning in Robotics · Robotics and Sensor-Based Localization

MethodsQ-Learning