COBRA: Data-Efficient Model-Based RL through Unsupervised Object   Discovery and Curiosity-Driven Exploration

Nicholas Watters; Loic Matthey; Matko Bosnjak; Christopher P. Burgess,; Alexander Lerchner

arXiv:1905.09275·cs.LG·August 15, 2019·67 cites

COBRA: Data-Efficient Model-Based RL through Unsupervised Object Discovery and Curiosity-Driven Exploration

Nicholas Watters, Loic Matthey, Matko Bosnjak, Christopher P. Burgess,, Alexander Lerchner

PDF

Open Access 2 Repos

TL;DR

COBRA is a modular, data-efficient model-based reinforcement learning approach that leverages unsupervised object discovery and curiosity-driven exploration to improve robustness and sample efficiency in continuous control tasks.

Contribution

It introduces a novel unsupervised, object-based modeling framework combined with intrinsic motivation for exploration, enabling rapid learning of diverse tasks without supervision.

Findings

01

Achieves high data efficiency in continuous control environments.

02

Demonstrates robustness to task-irrelevant perturbations.

03

Performs well on structured hold-out tests of policy robustness.

Abstract

Data efficiency and robustness to task-irrelevant perturbations are long-standing challenges for deep reinforcement learning algorithms. Here we introduce a modular approach to addressing these challenges in a continuous control environment, without using hand-crafted or supervised information. Our Curious Object-Based seaRch Agent (COBRA) uses task-free intrinsically motivated exploration and unsupervised learning to build object-based models of its environment and action space. Subsequently, it can learn a variety of tasks through model-based search in very few steps and excel on structured hold-out tests of policy robustness.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robotic Path Planning Algorithms · Adversarial Robustness in Machine Learning