Training Deep Spiking Neural Networks

Eimantas Ledinauskas (1); Julius Ruseckas (1); Alfonsas Jur\v{s}\.enas; (1); Giedrius Bura\v{c}as (2) ((1) Baltic Institute of Advanced Technology,; Lithuania; (2) SRI International; USA)

arXiv:2006.04436·cs.NE·June 9, 2020·26 cites

Training Deep Spiking Neural Networks

Eimantas Ledinauskas (1), Julius Ruseckas (1), Alfonsas Jur\v{s}\.enas, (1), Giedrius Bura\v{c}as (2) ((1) Baltic Institute of Advanced Technology,, Lithuania, (2) SRI International, USA)

PDF

Open Access

TL;DR

This paper demonstrates direct training of deep spiking neural networks using surrogate gradient backpropagation, addressing training challenges and achieving efficient inference with fewer time steps compared to converted models.

Contribution

It introduces methods to overcome gradient issues in deep SNNs and applies batch normalization, enabling training of ResNet50 on CIFAR100 and Imagenette datasets.

Findings

01

Deep SNNs can be trained directly with surrogate gradients.

02

Training SNNs with fewer inference steps is feasible.

03

Accuracy is slightly lower than ANNs but with significant efficiency gains.

Abstract

Computation using brain-inspired spiking neural networks (SNNs) with neuromorphic hardware may offer orders of magnitude higher energy efficiency compared to the current analog neural networks (ANNs). Unfortunately, training SNNs with the same number of layers as state of the art ANNs remains a challenge. To our knowledge the only method which is successful in this regard is supervised training of ANN and then converting it to SNN. In this work we directly train deep SNNs using backpropagation with surrogate gradient and find that due to implicitly recurrent nature of feed forward SNN's the exploding or vanishing gradient problem severely hinders their training. We show that this problem can be solved by tuning the surrogate gradient function. We also propose using batch normalization from ANN literature on input currents of SNN neurons. Using these improvements we show that is is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Neural dynamics and brain function · Ferroelectric and Negative Capacitance Devices

MethodsBatch Normalization