Entity Abstraction in Visual Model-Based Reinforcement Learning

Rishi Veerapaneni; John D. Co-Reyes; Michael Chang; Michael Janner,; Chelsea Finn; Jiajun Wu; Joshua B. Tenenbaum; Sergey Levine

arXiv:1910.12827·cs.LG·May 7, 2020·32 cites

Entity Abstraction in Visual Model-Based Reinforcement Learning

Rishi Veerapaneni, John D. Co-Reyes, Michael Chang, Michael Janner,, Chelsea Finn, Jiajun Wu, Joshua B. Tenenbaum, Sergey Levine

PDF

Open Access 1 Repo

TL;DR

This paper introduces OP3, a probabilistic entity-centric framework for model-based reinforcement learning that learns object representations from raw visuals and generalizes well to new configurations, outperforming existing models.

Contribution

The paper presents the first fully probabilistic, entity-centric dynamic latent variable framework for visual model-based RL that learns from raw observations without supervision.

Findings

01

OP3 generalizes to unseen object configurations and quantities.

02

OP3 outperforms models with object supervision.

03

OP3 achieves 2-3x better accuracy than non-entity models.

Abstract

This paper tests the hypothesis that modeling a scene in terms of entities and their local interactions, as opposed to modeling the scene globally, provides a significant benefit in generalizing to physical tasks in a combinatorial space the learner has not encountered before. We present object-centric perception, prediction, and planning (OP3), which to the best of our knowledge is the first fully probabilistic entity-centric dynamic latent variable framework for model-based reinforcement learning that acquires entity representations from raw visual observations without supervision and uses them to predict and plan. OP3 enforces entity-abstraction -- symmetric processing of each entity representation with the same locally-scoped function -- which enables it to scale to model different numbers and configurations of objects from those in training. Our approach to solving the key…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jcoreyes/OP3
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning