Autonomous Reinforcement of Behavioral Sequences in Neural Dynamics

Sohrob Kazerounian; Matthew Luciw; Mathis Richter; Yulia; Sandamirskaya

arXiv:1210.3569·cs.NE·May 15, 2013·2 cites

Autonomous Reinforcement of Behavioral Sequences in Neural Dynamics

Sohrob Kazerounian, Matthew Luciw, Mathis Richter, Yulia, Sandamirskaya

PDF

Open Access

TL;DR

This paper presents DN-SARSA(λ), a neural algorithm that learns behavioral sequences from delayed rewards, integrating neural models, reinforcement learning, and working memory, validated on robots to demonstrate neural-compatible learning.

Contribution

It introduces DN-SARSA(λ), a novel neural reinforcement learning algorithm combining neural dynamics with sequence learning, validated on robotic platforms.

Findings

01

DN-SARSA(λ) performs comparably to discrete SARSA(λ) in learning sequences.

02

The algorithm successfully learns from exploration in both simulated and real robots.

03

Demonstrates feasibility of neural-compatible reinforcement learning for behavioral sequences.

Abstract

We introduce a dynamic neural algorithm called Dynamic Neural (DN) SARSA(\lambda) for learning a behavioral sequence from delayed reward. DN-SARSA(\lambda) combines Dynamic Field Theory models of behavioral sequence representation, classical reinforcement learning, and a computational neuroscience model of working memory, called Item and Order working memory, which serves as an eligibility trace. DN-SARSA(\lambda) is implemented on both a simulated and real robot that must learn a specific rewarding sequence of elementary behaviors from exploration. Results show DN-SARSA(\lambda) performs on the level of the discrete SARSA(\lambda), validating the feasibility of general reinforcement learning without compromising neural dynamics.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Neural dynamics and brain function · EEG and Brain-Computer Interfaces