A Complementary Learning Systems Approach to Temporal Difference Learning
Sam Blakeman, Denis Mareschal

TL;DR
This paper introduces a novel reinforcement learning algorithm inspired by brain systems, combining neural networks and self-organizing maps to improve data efficiency and flexibility in decision-making tasks.
Contribution
It proposes Complementary Temporal Difference Learning (CTDL), integrating a DNN with a SOM to mimic brain's complementary learning systems for RL.
Findings
CTDL outperforms classic DQN in grid world and Cart-Pole tasks.
TD error effectively mediates between the two systems.
Approach demonstrates biological plausibility.
Abstract
Complementary Learning Systems (CLS) theory suggests that the brain uses a 'neocortical' and a 'hippocampal' learning system to achieve complex behavior. These two systems are complementary in that the 'neocortical' system relies on slow learning of distributed representations while the 'hippocampal' system relies on fast learning of pattern-separated representations. Both of these systems project to the striatum, which is a key neural structure in the brain's implementation of Reinforcement Learning (RL). Current deep RL approaches share similarities with a 'neocortical' system because they slowly learn distributed representations through backpropagation in Deep Neural Networks (DNNs). An ongoing criticism of such approaches is that they are data inefficient and lack flexibility. CLS theory suggests that the addition of a 'hippocampal' system could address these criticisms. In the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMemory and Neural Mechanisms · Neural dynamics and brain function · EEG and Brain-Computer Interfaces
MethodsSelf-Organizing Map
