Visual Backtracking Teleoperation: A Data Collection Protocol for   Offline Image-Based Reinforcement Learning

David Brandfonbrener; Stephen Tu; Avi Singh; Stefan Welker; Chad; Boodoo; Nikolai Matni; Jake Varley

arXiv:2210.02343·cs.RO·October 6, 2022

Visual Backtracking Teleoperation: A Data Collection Protocol for Offline Image-Based Reinforcement Learning

David Brandfonbrener, Stephen Tu, Avi Singh, Stefan Welker, Chad, Boodoo, Nikolai Matni, Jake Varley

PDF

Open Access

TL;DR

This paper introduces Visual Backtracking Teleoperation (VBT), a novel data collection protocol that enhances offline image-based reinforcement learning for robotic tasks by including failures and recoveries, leading to improved value functions and policies.

Contribution

The paper proposes VBT, a new data collection method that captures diverse robot behaviors, including failures and recoveries, to improve offline RL performance from small datasets.

Findings

01

VBT data improves value function accuracy.

02

Offline RL on VBT data outperforms behavior cloning by 13%.

03

VBT enhances policy robustness in deformable manipulation tasks.

Abstract

We consider how to most efficiently leverage teleoperator time to collect data for learning robust image-based value functions and policies for sparse reward robotic tasks. To accomplish this goal, we modify the process of data collection to include more than just successful demonstrations of the desired task. Instead we develop a novel protocol that we call Visual Backtracking Teleoperation (VBT), which deliberately collects a dataset of visually similar failures, recoveries, and successes. VBT data collection is particularly useful for efficiently learning accurate value functions from small datasets of image-based observations. We demonstrate VBT on a real robot to perform continuous control from image observations for the deformable manipulation task of T-shirt grasping. We find that by adjusting the data collection process we improve the quality of both the learned value functions…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Neuroscience and Neural Engineering