A Neuromorphic Architecture for Reinforcement Learning from Real-Valued   Observations

Sergio F. Chevtchenko; Yeshwanth Bethi; Teresa B. Ludermir; Saeed; Afshar

arXiv:2307.02947·cs.NE·August 9, 2023·1 cites

A Neuromorphic Architecture for Reinforcement Learning from Real-Valued Observations

Sergio F. Chevtchenko, Yeshwanth Bethi, Teresa B. Ludermir, Saeed, Afshar

PDF

Open Access

TL;DR

This paper introduces a neuromorphic SNN architecture for reinforcement learning with real-valued observations, demonstrating hardware-efficient online learning and superior performance on classic control tasks.

Contribution

It presents a novel bio-inspired SNN model with TD-error modulation and eligibility traces, advancing hardware-efficient RL implementations.

Findings

01

Outperforms tabular actor-critic in RL benchmarks

02

Successfully learns stable policies in classic control environments

03

Offers a hardware-friendly online learning approach

Abstract

Reinforcement Learning (RL) provides a powerful framework for decision-making in complex environments. However, implementing RL in hardware-efficient and bio-inspired ways remains a challenge. This paper presents a novel Spiking Neural Network (SNN) architecture for solving RL problems with real-valued observations. The proposed model incorporates multi-layered event-based clustering, with the addition of Temporal Difference (TD)-error modulation and eligibility traces, building upon prior work. An ablation study confirms the significant impact of these components on the proposed model's performance. A tabular actor-critic algorithm with eligibility traces and a state-of-the-art Proximal Policy Optimization (PPO) algorithm are used as benchmarks. Our network consistently outperforms the tabular approach and successfully discovers stable control policies on classic RL environments:…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Neural Networks and Reservoir Computing · Reinforcement Learning in Robotics