Learning model-based strategies in simple environments with hierarchical   q-networks

Necati Alp Muyesser; Kyle Dunovan; Timothy Verstynen

arXiv:1801.06689·cs.AI·January 23, 2018·1 cites

Learning model-based strategies in simple environments with hierarchical q-networks

Necati Alp Muyesser, Kyle Dunovan, Timothy Verstynen

PDF

Open Access 1 Repo

TL;DR

This paper introduces a Hierarchical Q-Network inspired by human brain organization that learns generalizable strategies in simple environments, outperforming traditional RL methods and enabling transparent inspection of learned rules.

Contribution

The novel Hierarchical Q-Network (HQN) demonstrates improved strategy generalization and interpretability in simple environments, inspired by biological hierarchical structures.

Findings

01

HQN learned heuristic strategies that generalized across game variations.

02

Traditional RL approaches failed to perform well on Wythoff's game variants.

03

HQN's internal model of the game was transparent after training.

Abstract

Recent advances in deep learning have allowed artificial agents to rival human-level performance on a wide range of complex tasks; however, the ability of these networks to learn generalizable strategies remains a pressing challenge. This critical limitation is due in part to two factors: the opaque information representation in deep neural networks and the complexity of the task environments in which they are typically deployed. Here we propose a novel Hierarchical Q-Network (HQN) motivated by theories of the hierarchical organization of the human prefrontal cortex, that attempts to identify lower dimensional patterns in the value landscape that can be exploited to construct an internal model of rules in simple environments. We draw on combinatorial games, where there exists a single optimal strategy for winning that generalizes across other features of the game, to probe the strategy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

CoAxLab/azad
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural dynamics and brain function · Reinforcement Learning in Robotics · Neural and Behavioral Psychology Studies