E3NE: An End-to-End Framework for Accelerating Spiking Neural Networks   with Emerging Neural Encoding on FPGAs

Daniel Gerlinghoff; Zhehui Wang; Xiaozhe Gu; Rick Siow Mong Goh; Tao; Luo

arXiv:2111.10027·cs.NE·June 7, 2022

E3NE: An End-to-End Framework for Accelerating Spiking Neural Networks with Emerging Neural Encoding on FPGAs

Daniel Gerlinghoff, Zhehui Wang, Xiaozhe Gu, Rick Siow Mong Goh, Tao, Luo

PDF

1 Repo

TL;DR

E3NE is an end-to-end FPGA framework that optimizes spiking neural network inference, achieving higher efficiency, lower power, and reduced latency compared to previous implementations, enabling deployment of large-scale models.

Contribution

It introduces a novel framework that automates SNN optimization on FPGAs using emerging neural encoding, improving efficiency and scalability over prior methods.

Findings

01

Uses less than 50% hardware resources compared to previous SNN implementations.

02

Reduces power consumption by 20%.

03

Reduces latency by an order of magnitude.

Abstract

Compiler frameworks are crucial for the widespread use of FPGA-based deep learning accelerators. They allow researchers and developers, who are not familiar with hardware engineering, to harness the performance attained by domain-specific logic. There exists a variety of frameworks for conventional artificial neural networks. However, not much research effort has been put into the creation of frameworks optimized for spiking neural networks (SNNs). This new generation of neural networks becomes increasingly interesting for the deployment of AI on edge devices, which have tight power and resource constraints. Our end-to-end framework E3NE automates the generation of efficient SNN inference logic for FPGAs. Based on a PyTorch model and user parameters, it applies various optimizations and assesses trade-offs inherent to spike-based accelerators. Multiple levels of parallelism and the use…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

danielgerlinghoff/radix-encoding
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsConvolution · Max Pooling · Dense Connections · Softmax · Dropout