A Complementary Learning Systems Approach to Temporal Difference   Learning

Sam Blakeman; Denis Mareschal

arXiv:1905.02636·q-bio.NC·May 8, 2019·1 cites

A Complementary Learning Systems Approach to Temporal Difference Learning

Sam Blakeman, Denis Mareschal

PDF

Open Access

TL;DR

This paper introduces a novel reinforcement learning algorithm inspired by brain systems, combining neural networks and self-organizing maps to improve data efficiency and flexibility in decision-making tasks.

Contribution

It proposes Complementary Temporal Difference Learning (CTDL), integrating a DNN with a SOM to mimic brain's complementary learning systems for RL.

Findings

01

CTDL outperforms classic DQN in grid world and Cart-Pole tasks.

02

TD error effectively mediates between the two systems.

03

Approach demonstrates biological plausibility.

Abstract

Complementary Learning Systems (CLS) theory suggests that the brain uses a 'neocortical' and a 'hippocampal' learning system to achieve complex behavior. These two systems are complementary in that the 'neocortical' system relies on slow learning of distributed representations while the 'hippocampal' system relies on fast learning of pattern-separated representations. Both of these systems project to the striatum, which is a key neural structure in the brain's implementation of Reinforcement Learning (RL). Current deep RL approaches share similarities with a 'neocortical' system because they slowly learn distributed representations through backpropagation in Deep Neural Networks (DNNs). An ongoing criticism of such approaches is that they are data inefficient and lack flexibility. CLS theory suggests that the addition of a 'hippocampal' system could address these criticisms. In the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMemory and Neural Mechanisms · Neural dynamics and brain function · EEG and Brain-Computer Interfaces

MethodsSelf-Organizing Map