Elucidating the theoretical underpinnings of surrogate gradient learning   in spiking neural networks

Julia Gygax; Friedemann Zenke

arXiv:2404.14964·cs.NE·November 19, 2024·1 cites

Elucidating the theoretical underpinnings of surrogate gradient learning in spiking neural networks

Julia Gygax, Friedemann Zenke

PDF

Open Access 1 Repo

TL;DR

This paper provides a theoretical foundation for surrogate gradient learning in spiking neural networks by relating it to stochastic automatic differentiation, supporting its practical effectiveness.

Contribution

It establishes a theoretical link between surrogate gradients and stochastic automatic differentiation in spiking neural networks, clarifying their validity and applicability.

Findings

01

Surrogate gradients are theoretically justified via stochastic automatic differentiation.

02

Empirical results confirm surrogate gradients' effectiveness in stochastic multi-layer networks.

03

Surrogate gradients are not generally derivatives of a surrogate loss, but are effective in practice.

Abstract

Training spiking neural networks to approximate universal functions is essential for studying information processing in the brain and for neuromorphic computing. Yet the binary nature of spikes poses a challenge for direct gradient-based training. Surrogate gradients have been empirically successful in circumventing this problem, but their theoretical foundation remains elusive. Here, we investigate the relation of surrogate gradients to two theoretically well-founded approaches. On the one hand, we consider smoothed probabilistic models, which, due to the lack of support for automatic differentiation, are impractical for training multi-layer spiking neural networks but provide derivatives equivalent to surrogate gradients for single neurons. On the other hand, we investigate stochastic automatic differentiation, which is compatible with discrete randomness but has not yet been used to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fmi-basel/surrogate-gradient-theory
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Advanced Memory and Neural Computing · Machine Learning and ELM