VRL3: A Data-Driven Framework for Visual Deep Reinforcement Learning

Che Wang; Xufang Luo; Keith Ross; Dongsheng Li

arXiv:2202.10324·cs.CV·April 3, 2023·5 cites

VRL3: A Data-Driven Framework for Visual Deep Reinforcement Learning

Che Wang, Xufang Luo, Keith Ross, Dongsheng Li

PDF

Open Access 1 Repo 1 Video

TL;DR

VRL3 introduces a three-stage data-driven framework for visual deep reinforcement learning, significantly improving sample efficiency and solving complex tasks with less data and computation.

Contribution

The paper presents VRL3, a novel three-stage framework that combines visual representation learning, offline RL, and online fine-tuning for challenging visual DRL tasks.

Findings

01

Achieves 780% better sample efficiency on hand manipulation tasks.

02

Solves the hardest task with only 10% of the usual computation.

03

Demonstrates the effectiveness of data-driven approaches in complex visual DRL environments.

Abstract

We propose VRL3, a powerful data-driven framework with a simple design for solving challenging visual deep reinforcement learning (DRL) tasks. We analyze a number of major obstacles in taking a data-driven approach, and present a suite of design principles, novel findings, and critical insights about data-driven visual DRL. Our framework has three stages: in stage 1, we leverage non-RL datasets (e.g. ImageNet) to learn task-agnostic visual representations; in stage 2, we use offline RL data (e.g. a limited number of expert demonstrations) to convert the task-agnostic representations into more powerful task-specific representations; in stage 3, we fine-tune the agent with online RL. On a set of challenging hand manipulation tasks with sparse reward and realistic visual inputs, compared to the previous SOTA, VRL3 achieves an average of 780% better sample efficiency. And on the hardest…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

facebookresearch/drqv2
pytorchOfficial

Videos

VRL3: A Data-Driven Framework for Visual Deep Reinforcement Learning· slideslive

Taxonomy

TopicsTactile and Sensory Interactions