Biologically Plausible Variational Policy Gradient with Spiking   Recurrent Winner-Take-All Networks

Zhile Yang; Shangqi Guo; Ying Fang; Jian K. Liu

arXiv:2210.13225·cs.NE·October 25, 2022

Biologically Plausible Variational Policy Gradient with Spiking Recurrent Winner-Take-All Networks

Zhile Yang, Shangqi Guo, Ying Fang, Jian K. Liu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a biologically plausible reinforcement learning method called spiking variational policy gradient (SVPG) for spiking neural networks, derived from global policy gradients, improving robustness and eliminating heuristic local rules.

Contribution

The paper proposes a novel SVPG method for spiking recurrent networks, deriving local learning rules from global policy gradients, reducing heuristic design requirements.

Findings

01

SVPG achieves good training performance on MNIST and InvertedPendulum tasks.

02

SVPG demonstrates better robustness to noise compared to conventional methods.

03

The approach aligns local learning with global policy objectives.

Abstract

One stream of reinforcement learning research is exploring biologically plausible models and algorithms to simulate biological intelligence and fit neuromorphic hardware. Among them, reward-modulated spike-timing-dependent plasticity (R-STDP) is a recent branch with good potential in energy efficiency. However, current R-STDP methods rely on heuristic designs of local learning rules, thus requiring task-specific expert knowledge. In this paper, we consider a spiking recurrent winner-take-all network, and propose a new R-STDP method, spiking variational policy gradient (SVPG), whose local learning rules are derived from the global policy gradient and thus eliminate the need for heuristic designs. In experiments of MNIST classification and Gym InvertedPendulum, our SVPG achieves good training performance, and also presents better robustness to various kinds of noises than conventional…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yzlc080733/bmvc2022_svpg
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Neural Networks and Reservoir Computing

MethodsAdam · Stein Variational Policy Gradient