Sign Gradient Descent-based Neuronal Dynamics: ANN-to-SNN Conversion Beyond ReLU Network
Hyunseok Oh, Youngki Lee

TL;DR
This paper introduces a sign gradient descent-based neuronal dynamics framework that improves ANN-to-SNN conversion, enabling support for diverse nonlinearities and achieving state-of-the-art performance on large-scale datasets, including new DNN architectures.
Contribution
It provides a novel optimization-theoretic perspective on spiking neuron dynamics and extends ANN-to-SNN conversion capabilities beyond ReLU networks.
Findings
Achieves state-of-the-art ANN-to-SNN conversion performance.
Supports diverse nonlinearities beyond ReLU.
Successfully converts new DNN architectures like ConvNext and MLP-Mixer.
Abstract
Spiking neural network (SNN) is studied in multidisciplinary domains to (i) enable order-of-magnitudes energy-efficient AI inference and (ii) computationally simulate neuro-scientific mechanisms. The lack of discrete theory obstructs the practical application of SNN by limiting its performance and nonlinearity support. We present a new optimization-theoretic perspective of the discrete dynamics of spiking neurons. We prove that a discrete dynamical system of simple integrate-and-fire models approximates the sub-gradient method over unconstrained optimization problems. We practically extend our theory to introduce a novel sign gradient descent (signGD)-based neuronal dynamics that can (i) approximate diverse nonlinearities beyond ReLU and (ii) advance ANN-to-SNN conversion performance in low time steps. Experiments on large-scale datasets show that our technique achieves (i)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCell Image Analysis Techniques
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Average Pooling · Layer Normalization · Global Average Pooling · Affine Operator · Dropout · Dense Connections · Residual Connection · MLP-Mixer · Feedforward Network
