Learning Dynamic Attribute-factored World Models for Efficient Multi-object Reinforcement Learning
Fan Feng, Sara Magliacane

TL;DR
This paper introduces DAFT-RL, a framework that leverages object-centric representations and attribute-based graphs to improve generalization in multi-object reinforcement learning tasks, especially for unseen objects and task compositions.
Contribution
The paper proposes a novel attribute-factored world model that captures object dynamics and interactions at the attribute level, enhancing generalization in reinforcement learning.
Findings
Outperforms state-of-the-art in unseen object generalization
Effective in compositional task transfer
Handles varying object attributes and interactions
Abstract
In many reinforcement learning tasks, the agent has to learn to interact with many objects of different types and generalize to unseen combinations and numbers of objects. Often a task is a composition of previously learned tasks (e.g. block stacking). These are examples of compositional generalization, in which we compose object-centric representations to solve complex tasks. Recent works have shown the benefits of object-factored representations and hierarchical abstractions for improving sample efficiency in these settings. On the other hand, these methods do not fully exploit the benefits of factorization in terms of object attributes. In this paper, we address this opportunity and introduce the Dynamic Attribute FacTored RL (DAFT-RL) framework. In DAFT-RL, we leverage object-centric representation learning to extract objects from visual inputs. We learn to classify them in classes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsReinforcement Learning in Robotics
