DMC-VB: A Benchmark for Representation Learning for Control with Visual   Distractors

Joseph Ortiz; Antoine Dedieu; Wolfgang Lehrach; Swaroop Guntupalli,; Carter Wendelken; Ahmad Humayun; Guangyao Zhou; Sivaramakrishnan Swaminathan,; Miguel L\'azaro-Gredilla; Kevin Murphy

arXiv:2409.18330·cs.LG·September 30, 2024

DMC-VB: A Benchmark for Representation Learning for Control with Visual Distractors

Joseph Ortiz, Antoine Dedieu, Wolfgang Lehrach, Swaroop Guntupalli,, Carter Wendelken, Ahmad Humayun, Guangyao Zhou, Sivaramakrishnan Swaminathan,, Miguel L\'azaro-Gredilla, Kevin Murphy

PDF

Open Access 1 Repo 1 Video

TL;DR

The paper introduces DMC-VB, a comprehensive benchmark dataset for evaluating the robustness of offline reinforcement learning agents in continuous control tasks with visual distractors, highlighting challenges in representation learning from visual inputs.

Contribution

The paper presents a new large-scale dataset and benchmark for assessing representation learning methods in control tasks with visual distractors, including diverse tasks and systematic evaluation protocols.

Findings

01

Pretrained representations do not improve policy learning on DMC-VB.

02

A significant gap exists between policies learned on pixel observations and on true states.

03

Policy learning benefits from pretrained representations when expert data is limited, especially with suboptimal or stochastic hidden goal data.

Abstract

Learning from previously collected data via behavioral cloning or offline reinforcement learning (RL) is a powerful recipe for scaling generalist agents by avoiding the need for expensive online learning. Despite strong generalization in some respects, agents are often remarkably brittle to minor visual variations in control-irrelevant factors such as the background or camera viewpoint. In this paper, we present theDeepMind Control Visual Benchmark (DMC-VB), a dataset collected in the DeepMind Control Suite to evaluate the robustness of offline RL agents for solving continuous control tasks from visual input in the presence of visual distractors. In contrast to prior works, our dataset (a) combines locomotion and navigation tasks of varying difficulties, (b) includes static and dynamic visual variations, (c) considers data generated by policies with different skill levels, (d)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

google-deepmind/dmc_vision_benchmark
jaxOfficial

Videos

DMC-VB: A Benchmark for Representation Learning for Control with Visual Distractors· slideslive

Taxonomy

TopicsHuman Pose and Action Recognition · Multimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI)