Investigating Pre-Training Objectives for Generalization in Vision-Based   Reinforcement Learning

Donghu Kim; Hojoon Lee; Kyungmin Lee; Dongyoon Hwang; Jaegul Choo

arXiv:2406.06037·cs.LG·June 11, 2024

Investigating Pre-Training Objectives for Generalization in Vision-Based Reinforcement Learning

Donghu Kim, Hojoon Lee, Kyungmin Lee, Dongyoon Hwang, Jaegul Choo

PDF

Open Access 1 Repo

TL;DR

This paper introduces the Atari Pre-training Benchmark to evaluate how different pre-training objectives affect the generalization of vision-based reinforcement learning across diverse environments, highlighting the importance of task-agnostic features.

Contribution

The paper presents a new benchmark and comprehensive analysis of pre-training objectives, revealing their impact on generalization in vision-based RL.

Findings

01

Task-agnostic pre-training improves cross-environment generalization.

02

Task-specific pre-training enhances performance in similar environments.

03

Pre-training on diverse environments benefits overall robustness.

Abstract

Recently, various pre-training methods have been introduced in vision-based Reinforcement Learning (RL). However, their generalization ability remains unclear due to evaluations being limited to in-distribution environments and non-unified experimental setups. To address this, we introduce the Atari Pre-training Benchmark (Atari-PB), which pre-trains a ResNet-50 model on 10 million transitions from 50 Atari games and evaluates it across diverse environment distributions. Our experiments show that pre-training objectives focused on learning task-agnostic features (e.g., identifying objects and understanding temporal dynamics) enhance generalization across different environments. In contrast, objectives focused on learning task-specific knowledge (e.g., identifying agents and fitting reward functions) improve performance in environments similar to the pre-training dataset but not in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dojeon-ai/atari-pb
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics