Elucidating the theoretical underpinnings of surrogate gradient learning in spiking neural networks
Julia Gygax, Friedemann Zenke

TL;DR
This paper provides a theoretical foundation for surrogate gradient learning in spiking neural networks by relating it to stochastic automatic differentiation, supporting its practical effectiveness.
Contribution
It establishes a theoretical link between surrogate gradients and stochastic automatic differentiation in spiking neural networks, clarifying their validity and applicability.
Findings
Surrogate gradients are theoretically justified via stochastic automatic differentiation.
Empirical results confirm surrogate gradients' effectiveness in stochastic multi-layer networks.
Surrogate gradients are not generally derivatives of a surrogate loss, but are effective in practice.
Abstract
Training spiking neural networks to approximate universal functions is essential for studying information processing in the brain and for neuromorphic computing. Yet the binary nature of spikes poses a challenge for direct gradient-based training. Surrogate gradients have been empirically successful in circumventing this problem, but their theoretical foundation remains elusive. Here, we investigate the relation of surrogate gradients to two theoretically well-founded approaches. On the one hand, we consider smoothed probabilistic models, which, due to the lack of support for automatic differentiation, are impractical for training multi-layer spiking neural networks but provide derivatives equivalent to surrogate gradients for single neurons. On the other hand, we investigate stochastic automatic differentiation, which is compatible with discrete randomness but has not yet been used to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Advanced Memory and Neural Computing · Machine Learning and ELM
