Human-Level Reinforcement Learning through Theory-Based Modeling,   Exploration, and Planning

Pedro A. Tsividis; Joao Loula; Jake Burga; Nathan Foss; Andres; Campero; Thomas Pouncy; Samuel J. Gershman; Joshua B. Tenenbaum

arXiv:2107.12544·cs.AI·July 28, 2021·22 cites

Human-Level Reinforcement Learning through Theory-Based Modeling, Exploration, and Planning

Pedro A. Tsividis, Joao Loula, Jake Burga, Nathan Foss, Andres, Campero, Thomas Pouncy, Samuel J. Gershman, Joshua B. Tenenbaum

PDF

Open Access

TL;DR

This paper introduces Theory-Based Reinforcement Learning, a model-based approach using human-like causal theories to enable rapid, human-like learning and generalization in complex environments, exemplified by a video game agent.

Contribution

It presents a novel RL framework that incorporates intuitive theories for efficient learning, demonstrated through a versatile agent that learns multiple games quickly and generalizes well.

Findings

01

EMPA learns new games in minutes

02

EMPA generalizes to new levels and situations

03

Model captures human-like exploration and learning dynamics

Abstract

Reinforcement learning (RL) studies how an agent comes to achieve reward in an environment through interactions over time. Recent advances in machine RL have surpassed human expertise at the world's oldest board games and many classic video games, but they require vast quantities of experience to learn successfully -- none of today's algorithms account for the human ability to learn so many different tasks, so quickly. Here we propose a new approach to this challenge based on a particularly strong form of model-based RL which we call Theory-Based Reinforcement Learning, because it uses human-like intuitive theories -- rich, abstract, causal models of physical objects, intentional agents, and their interactions -- to explore and model an environment, and plan effectively to achieve task goals. We instantiate the approach in a video game playing agent called EMPA (the Exploring, Modeling,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games · Explainable Artificial Intelligence (XAI)