Schema Networks: Zero-shot Transfer with a Generative Causal Model of   Intuitive Physics

Ken Kansky; Tom Silver; David A. M\'ely; Mohamed Eldawy; Miguel; L\'azaro-Gredilla; Xinghua Lou; Nimrod Dorfman; Szymon Sidor; Scott Phoenix,; Dileep George

arXiv:1706.04317·cs.AI·August 21, 2017·150 cites

Schema Networks: Zero-shot Transfer with a Generative Causal Model of Intuitive Physics

Ken Kansky, Tom Silver, David A. M\'ely, Mohamed Eldawy, Miguel, L\'azaro-Gredilla, Xinghua Lou, Nimrod Dorfman, Szymon Sidor, Scott Phoenix,, Dileep George

PDF

Open Access 2 Repos

TL;DR

The paper introduces Schema Networks, an object-oriented generative physics model that learns environment dynamics from data, enabling efficient zero-shot transfer and robust generalization in reinforcement learning tasks.

Contribution

It presents a novel Schema Network architecture that models causal physics, improving transfer and generalization over existing deep learning methods.

Findings

01

Schema Networks outperform A3C and Progressive Networks in transfer tasks.

02

They demonstrate faster learning and better zero-shot generalization.

03

The approach effectively learns environment dynamics from limited data.

Abstract

The recent adaptation of deep neural network-based methods to reinforcement learning and planning domains has yielded remarkable progress on individual tasks. Nonetheless, progress on task-to-task transfer remains limited. In pursuit of efficient and robust generalization, we introduce the Schema Network, an object-oriented generative physics simulator capable of disentangling multiple causes of events and reasoning backward through causes to achieve goals. The richly structured architecture of the Schema Network can learn the dynamics of an environment directly from data. We compare Schema Networks with Asynchronous Advantage Actor-Critic and Progressive Networks on a suite of Breakout variations, reporting results on training efficiency and zero-shot generalization, consistently demonstrating faster, more robust learning and better transfer. We argue that generalizing from limited…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning