Deep Residual Learning in Spiking Neural Networks
Wei Fang, Zhaofei Yu, Yanqi Chen, Tiejun Huang, Timoth\'ee Masquelier,, Yonghong Tian

TL;DR
This paper introduces the spike-element-wise (SEW) ResNet, a novel residual learning architecture for deep Spiking Neural Networks that overcomes previous training limitations, enabling the training of networks with over 100 layers and achieving superior accuracy.
Contribution
The paper proposes the SEW ResNet architecture for deep SNNs, demonstrating its ability to implement residual learning and train very deep networks effectively.
Findings
SEW ResNet outperforms state-of-the-art SNNs in accuracy and efficiency.
Enables training of SNNs with more than 100 layers.
Achieves higher performance by simply adding more layers.
Abstract
Deep Spiking Neural Networks (SNNs) present optimization difficulties for gradient-based approaches due to discrete binary activation and complex spatial-temporal dynamics. Considering the huge success of ResNet in deep learning, it would be natural to train deep SNNs with residual learning. Previous Spiking ResNet mimics the standard residual block in ANNs and simply replaces ReLU activation layers with spiking neurons, which suffers the degradation problem and can hardly implement residual learning. In this paper, we propose the spike-element-wise (SEW) ResNet to realize residual learning in deep SNNs. We prove that the SEW ResNet can easily implement identity mapping and overcome the vanishing/exploding gradient problems of Spiking ResNet. We evaluate our SEW ResNet on ImageNet, DVS Gesture, and CIFAR10-DVS datasets, and show that SEW ResNet outperforms the state-of-the-art directly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Neural dynamics and brain function
MethodsAverage Pooling · 1x1 Convolution · Batch Normalization · Kaiming Initialization · Global Average Pooling · Residual Connection · Bottleneck Residual Block · *Communicated@Fast*How Do I Communicate to Expedia? · Convolution · Residual Block
