Efficient Exploration and Discriminative World Model Learning with an Object-Centric Abstraction
Anthony GX-Chen, Kenneth Marino, Rob Fergus

TL;DR
This paper introduces an object-centric hierarchical world model for reinforcement learning that improves exploration efficiency, transferability, and planning over long horizons by abstracting items and attributes at different levels.
Contribution
It proposes a fully model-based, discriminative world model utilizing hierarchical abstraction for improved exploration and transfer in RL, with demonstrated success in diverse environments.
Findings
Outperforms state-of-the-art low-level methods in 2D crafting and MiniHack environments.
Enables zero-shot and few-shot transfer across item types and environments.
Effectively plans over long horizons using the learned abstract states.
Abstract
In the face of difficult exploration problems in reinforcement learning, we study whether giving an agent an object-centric mapping (describing a set of items and their attributes) allow for more efficient learning. We found this problem is best solved hierarchically by modelling items at a higher level of state abstraction to pixels, and attribute change at a higher level of temporal abstraction to primitive actions. This abstraction simplifies the transition dynamic by making specific future states easier to predict. We make use of this to propose a fully model-based algorithm that learns a discriminative world model, plans to explore efficiently with only a count-based intrinsic reward, and can subsequently plan to reach any discovered (abstract) states. We demonstrate the model's ability to (i) efficiently solve single tasks, (ii) transfer zero-shot and few-shot across item types…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Data Processing Techniques · Time Series Analysis and Forecasting · Advanced Computational Techniques and Applications
MethodsSparse Evolutionary Training
