Learning Representations for Pixel-based Control: What Matters and Why?

Manan Tomar; Utkarsh A. Mishra; Amy Zhang; Matthew E. Taylor

arXiv:2111.07775·cs.LG·November 16, 2021·1 cites

Learning Representations for Pixel-based Control: What Matters and Why?

Manan Tomar, Utkarsh A. Mishra, Amy Zhang, Matthew E. Taylor

PDF

Open Access

TL;DR

This paper investigates the challenges of learning pixel-based control representations in complex environments with distractors, proposing a simple baseline and analyzing the limitations of existing methods to improve real-world RL applications.

Contribution

It introduces a straightforward baseline for pixel control in challenging settings and provides a detailed analysis of when and why existing methods fail, emphasizing the importance of benchmark characteristics.

Findings

01

Baseline approach learns meaningful representations without complex techniques.

02

Existing methods often fail or perform similarly to the baseline in distractor-rich environments.

03

Benchmark evaluation should consider environment characteristics like reward density and distractors.

Abstract

Learning representations for pixel-based control has garnered significant attention recently in reinforcement learning. A wide range of methods have been proposed to enable efficient learning, leading to sample complexities similar to those in the full state setting. However, moving beyond carefully curated pixel data sets (centered crop, appropriate lighting, clear background, etc.) remains challenging. In this paper, we adopt a more difficult setting, incorporating background distractors, as a first step towards addressing this challenge. We present a simple baseline approach that can learn meaningful representations with no metric-based learning, no data augmentations, no world-model learning, and no contrastive learning. We then analyze when and why previously proposed methods are likely to fail or reduce to the same performance as the baseline in this harder setting and why we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Single-cell and spatial transcriptomics · Model Reduction and Neural Networks