Entity Abstraction in Visual Model-Based Reinforcement Learning
Rishi Veerapaneni, John D. Co-Reyes, Michael Chang, Michael Janner,, Chelsea Finn, Jiajun Wu, Joshua B. Tenenbaum, Sergey Levine

TL;DR
This paper introduces OP3, a probabilistic entity-centric framework for model-based reinforcement learning that learns object representations from raw visuals and generalizes well to new configurations, outperforming existing models.
Contribution
The paper presents the first fully probabilistic, entity-centric dynamic latent variable framework for visual model-based RL that learns from raw observations without supervision.
Findings
OP3 generalizes to unseen object configurations and quantities.
OP3 outperforms models with object supervision.
OP3 achieves 2-3x better accuracy than non-entity models.
Abstract
This paper tests the hypothesis that modeling a scene in terms of entities and their local interactions, as opposed to modeling the scene globally, provides a significant benefit in generalizing to physical tasks in a combinatorial space the learner has not encountered before. We present object-centric perception, prediction, and planning (OP3), which to the best of our knowledge is the first fully probabilistic entity-centric dynamic latent variable framework for model-based reinforcement learning that acquires entity representations from raw visual observations without supervision and uses them to predict and plan. OP3 enforces entity-abstraction -- symmetric processing of each entity representation with the same locally-scoped function -- which enables it to scale to model different numbers and configurations of objects from those in training. Our approach to solving the key…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning
