Combining imagination and heuristics to learn strategies that generalize

Erik J Peterson; Necati Alp M\"uyesser; Timothy Verstynen; Kyle; Dunovan

arXiv:1809.03406·cs.AI·June 15, 2020

Combining imagination and heuristics to learn strategies that generalize

Erik J Peterson, Necati Alp M\"uyesser, Timothy Verstynen, Kyle, Dunovan

PDF

Open Access 1 Repo

TL;DR

This paper introduces a hierarchical reinforcement learning model that combines heuristics and imagination, inspired by human prefrontal networks, to improve learning speed, transferability, and interpretability in complex environments.

Contribution

It presents a novel stumbler-strategist network that integrates heuristics and imagination, enhancing generalization and learning efficiency in reinforcement learning tasks.

Findings

01

Accelerates learning in Wythoff's game

02

Enhances transfer to new games

03

Improves model interpretability

Abstract

Deep reinforcement learning can match or exceed human performance in stable contexts, but with minor changes to the environment artificial networks, unlike humans, often cannot adapt. Humans rely on a combination of heuristics to simplify computational load and imagination to extend experiential learning to new and more challenging environments. Motivated by theories of the hierarchical organization of the human prefrontal networks, we have developed a model of hierarchical reinforcement learning that combines both heuristics and imagination into a stumbler-strategist network. We test performance of this network using Wythoff's game, a gridworld environment with a known optimal strategy. We show that a heuristic labeling of each position as hot or cold, combined with imagined play, both accelerates learning and promotes transfer to novel games, while also improving model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

CoAxLab/azad
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics