Learning to Control Dynamical Agents via Spiking Neural Networks and Metropolis-Hastings Sampling
Ali Safa, Farida Mohsen, Ali Al-Zawqari

TL;DR
This paper introduces a novel Bayesian sampling framework using Metropolis-Hastings to train Spiking Neural Networks for reinforcement learning control tasks, bypassing gradient-based methods and improving performance.
Contribution
It presents the first MH-based training method for SNNs in RL, enabling direct optimization without backpropagation and demonstrating superior results on control benchmarks.
Findings
Outperforms DQL baselines in reward maximization
Reduces training episodes and network resources
Effective for neuromorphic hardware deployment
Abstract
Spiking Neural Networks (SNNs) offer biologically inspired, energy-efficient alternatives to traditional Deep Neural Networks (DNNs) for real-time control systems. However, their training presents several challenges, particularly for reinforcement learning (RL) tasks, due to the non-differentiable nature of spike-based communication. In this work, we introduce what is, to our knowledge, the first framework that employs Metropolis-Hastings (MH) sampling, a Bayesian inference technique, to train SNNs for dynamical agent control in RL environments without relying on gradient-based methods. Our approach iteratively proposes and probabilistically accepts network parameter updates based on accumulated reward signals, effectively circumventing the limitations of backpropagation while enabling direct optimization on neuromorphic platforms. We evaluated this framework on two standard control…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Neural dynamics and brain function · Neural Networks and Applications
