A Spiking Neural Network Learning Markov Chain
Mikhail Kiselev

TL;DR
This paper explores how spiking neural networks can learn and internalize the dynamics of an external environment modeled as a Markov chain, with applications to model-based reinforcement learning in continuous time.
Contribution
It introduces a novel SNN architecture with local plasticity rules capable of learning unknown Markov chain dynamics in continuous time.
Findings
SNN can learn Markov chain transition dynamics.
The model works in a continuous-time setting.
Demonstrated with a bouncing ball simulation.
Abstract
In this paper, the question how spiking neural network (SNN) learns and fixes in its internal structures a model of external world dynamics is explored. This question is important for implementation of the model-based reinforcement learning (RL), the realistic RL regime where the decisions made by SNN and their evaluation in terms of reward/punishment signals may be separated by significant time interval and sequence of intermediate evaluation-neutral world states. In the present work, I formalize world dynamics as a Markov chain with unknown a priori state transition probabilities, which should be learnt by the network. To make this problem formulation more realistic, I solve it in continuous time, so that duration of every state in the Markov chain may be different and is unknown. It is demonstrated how this task can be accomplished by an SNN with specially designed structure and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Neural dynamics and brain function · Neural Networks and Applications
