Bigger, Better, Faster: Human-level Atari with human-level efficiency

Max Schwarzer; Johan Obando-Ceron; Aaron Courville; Marc Bellemare,; Rishabh Agarwal; Pablo Samuel Castro

arXiv:2305.19452·cs.LG·November 14, 2023·5 cites

Bigger, Better, Faster: Human-level Atari with human-level efficiency

Max Schwarzer, Johan Obando-Ceron, Aaron Courville, Marc Bellemare,, Rishabh Agarwal, Pablo Samuel Castro

PDF

Open Access 3 Repos

TL;DR

This paper presents BBF, a value-based reinforcement learning agent that achieves super-human performance in Atari 100K with enhanced neural network scaling and design choices, setting new standards for sample efficiency.

Contribution

Introduction of BBF, a scalable, sample-efficient RL agent that surpasses human-level performance in Atari 100K benchmarks, with extensive analysis of design choices.

Findings

01

BBF achieves super-human performance in Atari 100K.

02

Scaling neural networks improves sample efficiency.

03

Design choices significantly impact RL performance.

Abstract

We introduce a value-based RL agent, which we call BBF, that achieves super-human performance in the Atari 100K benchmark. BBF relies on scaling the neural networks used for value estimation, as well as a number of other design choices that enable this scaling in a sample-efficient manner. We conduct extensive analyses of these design choices and provide insights for future work. We end with a discussion about updating the goalposts for sample-efficient RL research on the ALE. We make our code and data publicly available at https://github.com/google-research/google-research/tree/master/bigger_better_faster.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Reinforcement Learning in Robotics