Efficient Exploration and Discriminative World Model Learning with an   Object-Centric Abstraction

Anthony GX-Chen; Kenneth Marino; Rob Fergus

arXiv:2408.11816·cs.LG·April 15, 2025

Efficient Exploration and Discriminative World Model Learning with an Object-Centric Abstraction

Anthony GX-Chen, Kenneth Marino, Rob Fergus

PDF

Open Access 1 Video

TL;DR

This paper introduces an object-centric hierarchical world model for reinforcement learning that improves exploration efficiency, transferability, and planning over long horizons by abstracting items and attributes at different levels.

Contribution

It proposes a fully model-based, discriminative world model utilizing hierarchical abstraction for improved exploration and transfer in RL, with demonstrated success in diverse environments.

Findings

01

Outperforms state-of-the-art low-level methods in 2D crafting and MiniHack environments.

02

Enables zero-shot and few-shot transfer across item types and environments.

03

Effectively plans over long horizons using the learned abstract states.

Abstract

In the face of difficult exploration problems in reinforcement learning, we study whether giving an agent an object-centric mapping (describing a set of items and their attributes) allow for more efficient learning. We found this problem is best solved hierarchically by modelling items at a higher level of state abstraction to pixels, and attribute change at a higher level of temporal abstraction to primitive actions. This abstraction simplifies the transition dynamic by making specific future states easier to predict. We make use of this to propose a fully model-based algorithm that learns a discriminative world model, plans to explore efficiently with only a count-based intrinsic reward, and can subsequently plan to reach any discovered (abstract) states. We demonstrate the model's ability to (i) efficiently solve single tasks, (ii) transfer zero-shot and few-shot across item types…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Efficient Exploration and Discriminative World Model Learning with an Object-Centric Abstraction· slideslive

Taxonomy

TopicsAdvanced Data Processing Techniques · Time Series Analysis and Forecasting · Advanced Computational Techniques and Applications

MethodsSparse Evolutionary Training