Take A Shortcut Back: Mitigating the Gradient Vanishing for Training   Spiking Neural Networks

Yufei Guo; Yuanpei Chen; Zecheng Hao; Weihang Peng; Zhou Jie; Yuhan; Zhang; Xiaode Liu; Zhe Ma

arXiv:2401.04486·cs.CV·October 1, 2024·1 cites

Take A Shortcut Back: Mitigating the Gradient Vanishing for Training Spiking Neural Networks

Yufei Guo, Yuanpei Chen, Zecheng Hao, Weihang Peng, Zhou Jie, Yuhan, Zhang, Xiaode Liu, Zhe Ma

PDF

Open Access

TL;DR

This paper introduces a shortcut back-propagation method for training Spiking Neural Networks, effectively mitigating gradient vanishing issues and improving performance without adding inference complexity.

Contribution

It proposes a novel gradient transmission technique and an evolutionary training framework to enhance SNN training and accuracy.

Findings

01

Outperforms state-of-the-art methods on various datasets

02

Effectively mitigates gradient vanishing in SNN training

03

Maintains low inference burden

Abstract

The Spiking Neural Network (SNN) is a biologically inspired neural network infrastructure that has recently garnered significant attention. It utilizes binary spike activations to transmit information, thereby replacing multiplications with additions and resulting in high energy efficiency. However, training an SNN directly poses a challenge due to the undefined gradient of the firing spike process. Although prior works have employed various surrogate gradient training methods that use an alternative function to replace the firing process during back-propagation, these approaches ignore an intrinsic problem: gradient vanishing. To address this issue, we propose a shortcut back-propagation method in our paper, which advocates for transmitting the gradient directly from the loss to the shallow layers. This enables us to present the gradient to the shallow layers directly, thereby…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Neural dynamics and brain function · Neural Networks and Reservoir Computing

MethodsSpiking Neural Networks