OCAtari: Object-Centric Atari 2600 Reinforcement Learning Environments
Quentin Delfosse, Jannis Bl\"uml, Bjarne Gregori, Sebastian, Sztwiertnia, Kristian Kersting

TL;DR
OCAtari introduces an environment extension for Atari games that enables resource-efficient object-centric state extraction, facilitating object discovery, representation learning, and reinforcement learning, thus advancing research in perceptually grounded AI.
Contribution
The paper presents OCAtari, a novel extension to Atari environments that supports object-centric perception and learning, addressing limitations of pixel-based approaches.
Findings
OCAtari effectively detects objects in Atari games.
The framework is resource-efficient for object extraction.
OCAtari enables object-centric reinforcement learning.
Abstract
Cognitive science and psychology suggest that object-centric representations of complex scenes are a promising step towards enabling efficient abstract reasoning from low-level perceptual features. Yet, most deep reinforcement learning approaches only rely on pixel-based representations that do not capture the compositional properties of natural scenes. For this, we need environments and datasets that allow us to work and evaluate object-centric approaches. In our work, we extend the Atari Learning Environments, the most-used evaluation framework for deep RL approaches, by introducing OCAtari, that performs resource-efficient extractions of the object-centric states for these games. Our framework allows for object discovery, object representation learning, as well as object-centric RL. We evaluate OCAtari's detection capabilities and resource efficiency. Our source code is available at…
Peer Reviews
Decision·Submitted to ICLR 2024
This work provides a concrete benchmark/dataset to train and evaluate object centric properties of RL approaches. Key strengths of the paper include: * The benchmark can serve a nice tool to test the abstraction ability of our methods today as humans seamlessly can detect objects and reason in the space of object oriented. * It can inform the quality of representations learned in ALE domains and further if the methods have abilities such as compositional generalization, etc. * The work
The paper offers interesting contribution, however I believe the paper is not up to the mark for the ICLR conference venue. I find the following issues major limiting factor in recommending acceptance: * The topic and contribution is relevant, however it is unclear to me immediately what this buys us for methods not focused at object central. For e.g. a method might be able to achieve very good performance but not do well on object centric evaluation. * What is missing and would be nice to
The library can serve as a best-case result for comparing against approaches that do Atari object detection without access to the RAM state. It also allows research to proceed on designing agents that exploit object-centric representations without first waiting for object-detection methods. The paper also argues that when visual features are not needed, this library can dramatically speed up the training process by eliminating much of the rendering pipeline.
1. I am somewhat skeptical that this library will be broadly useful to the field. The paper presents a publication-count-based argument that there is a need for object-centric Atari environments. I don't draw the same conclusion from the presented evidence. The authors may be surprised to learn that one of the earliest Atari + reinforcement learning papers (from 2008!) dealt with this exact question: Diuk, Cohen & Littman's paper on Object-Oriented MDPs. The fact that Atari did not become a popu
1. Creating the object-centric Atari Learning Environment greatly impacts the community that develops the object-centric RL algorithms. 2. The source code is available.
1. Although the introduction is well-written, it is still unclear why the ALE was selected. Please see my first question below.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning
MethodsBalanced Selection
