CALE: Continuous Arcade Learning Environment

Jesse Farebrother; Pablo Samuel Castro

arXiv:2410.23810·cs.LG·November 1, 2024

CALE: Continuous Arcade Learning Environment

Jesse Farebrother, Pablo Samuel Castro

PDF

Open Access 1 Repo

TL;DR

The paper introduces CALE, an extension of the Arcade Learning Environment that supports continuous actions, enabling benchmarking of continuous-control and value-based agents on Atari 2600 games.

Contribution

CALE extends the ALE with continuous actions, allowing for unified evaluation of diverse reinforcement learning agents on Atari environments.

Findings

01

Initial baseline results with Soft Actor-Critic.

02

Open research questions and directions provided.

03

Supports benchmarking of continuous and discrete agents.

Abstract

We introduce the Continuous Arcade Learning Environment (CALE), an extension of the well-known Arcade Learning Environment (ALE) [Bellemare et al., 2013]. The CALE uses the same underlying emulator of the Atari 2600 gaming system (Stella), but adds support for continuous actions. This enables the benchmarking and evaluation of continuous-control agents (such as PPO [Schulman et al., 2017] and SAC [Haarnoja et al., 2018]) and value-based agents (such as DQN [Mnih et al., 2015] and Rainbow [Hessel et al., 2018]) on the same environment suite. We provide a series of open questions and research directions that CALE enables, as well as initial baseline results using Soft Actor-Critic. CALE is available as part of the ALE athttps://github.com/Farama-Foundation/Arcade-Learning-Environment.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

farama-foundation/arcade-learning-environment
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInnovative Teaching and Learning Methods · Online Learning and Analytics · Intelligent Tutoring Systems and Adaptive Learning

MethodsDense Connections · Q-Learning · Deep Q-Network · Dilated Convolution · Convolution · Entropy Regularization · Average Pooling · Global Average Pooling · Proximal Policy Optimization · 1x1 Convolution