OCAtari: Object-Centric Atari 2600 Reinforcement Learning Environments

Quentin Delfosse; Jannis Bl\"uml; Bjarne Gregori; Sebastian; Sztwiertnia; Kristian Kersting

arXiv:2306.08649·cs.LG·February 28, 2024·1 cites

OCAtari: Object-Centric Atari 2600 Reinforcement Learning Environments

Quentin Delfosse, Jannis Bl\"uml, Bjarne Gregori, Sebastian, Sztwiertnia, Kristian Kersting

PDF

Open Access 1 Repo 3 Reviews

TL;DR

OCAtari introduces an environment extension for Atari games that enables resource-efficient object-centric state extraction, facilitating object discovery, representation learning, and reinforcement learning, thus advancing research in perceptually grounded AI.

Contribution

The paper presents OCAtari, a novel extension to Atari environments that supports object-centric perception and learning, addressing limitations of pixel-based approaches.

Findings

01

OCAtari effectively detects objects in Atari games.

02

The framework is resource-efficient for object extraction.

03

OCAtari enables object-centric reinforcement learning.

Abstract

Cognitive science and psychology suggest that object-centric representations of complex scenes are a promising step towards enabling efficient abstract reasoning from low-level perceptual features. Yet, most deep reinforcement learning approaches only rely on pixel-based representations that do not capture the compositional properties of natural scenes. For this, we need environments and datasets that allow us to work and evaluate object-centric approaches. In our work, we extend the Atari Learning Environments, the most-used evaluation framework for deep RL approaches, by introducing OCAtari, that performs resource-efficient extractions of the object-centric states for these games. Our framework allows for object discovery, object representation learning, as well as object-centric RL. We evaluate OCAtari's detection capabilities and resource efficiency. Our source code is available at…

Peer Reviews

Decision·Submitted to ICLR 2024

Reviewer 01Rating 3· reject, not good enoughConfidence 4

Strengths

This work provides a concrete benchmark/dataset to train and evaluate object centric properties of RL approaches. Key strengths of the paper include: * The benchmark can serve a nice tool to test the abstraction ability of our methods today as humans seamlessly can detect objects and reason in the space of object oriented. * It can inform the quality of representations learned in ALE domains and further if the methods have abilities such as compositional generalization, etc. * The work

Weaknesses

The paper offers interesting contribution, however I believe the paper is not up to the mark for the ICLR conference venue. I find the following issues major limiting factor in recommending acceptance: * The topic and contribution is relevant, however it is unclear to me immediately what this buys us for methods not focused at object central. For e.g. a method might be able to achieve very good performance but not do well on object centric evaluation. * What is missing and would be nice to

Reviewer 02Rating 5· marginally below the acceptance thresholdConfidence 4

Strengths

The library can serve as a best-case result for comparing against approaches that do Atari object detection without access to the RAM state. It also allows research to proceed on designing agents that exploit object-centric representations without first waiting for object-detection methods. The paper also argues that when visual features are not needed, this library can dramatically speed up the training process by eliminating much of the rendering pipeline.

Weaknesses

1. I am somewhat skeptical that this library will be broadly useful to the field. The paper presents a publication-count-based argument that there is a need for object-centric Atari environments. I don't draw the same conclusion from the presented evidence. The authors may be surprised to learn that one of the earliest Atari + reinforcement learning papers (from 2008!) dealt with this exact question: Diuk, Cohen & Littman's paper on Object-Oriented MDPs. The fact that Atari did not become a popu

Reviewer 03Rating 5· marginally below the acceptance thresholdConfidence 4

Strengths

1. Creating the object-centric Atari Learning Environment greatly impacts the community that develops the object-centric RL algorithms. 2. The source code is available.

Weaknesses

1. Although the introduction is well-written, it is still unclear why the ALE was selected. Please see my first question below.

Code & Models

Repositories

k4ntz/oc_atari
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning

MethodsBalanced Selection