TL;DR
E3NE is an end-to-end FPGA framework that optimizes spiking neural network inference, achieving higher efficiency, lower power, and reduced latency compared to previous implementations, enabling deployment of large-scale models.
Contribution
It introduces a novel framework that automates SNN optimization on FPGAs using emerging neural encoding, improving efficiency and scalability over prior methods.
Findings
Uses less than 50% hardware resources compared to previous SNN implementations.
Reduces power consumption by 20%.
Reduces latency by an order of magnitude.
Abstract
Compiler frameworks are crucial for the widespread use of FPGA-based deep learning accelerators. They allow researchers and developers, who are not familiar with hardware engineering, to harness the performance attained by domain-specific logic. There exists a variety of frameworks for conventional artificial neural networks. However, not much research effort has been put into the creation of frameworks optimized for spiking neural networks (SNNs). This new generation of neural networks becomes increasingly interesting for the deployment of AI on edge devices, which have tight power and resource constraints. Our end-to-end framework E3NE automates the generation of efficient SNN inference logic for FPGAs. Based on a PyTorch model and user parameters, it applies various optimizations and assesses trade-offs inherent to spike-based accelerators. Multiple levels of parallelism and the use…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsConvolution · Max Pooling · Dense Connections · Softmax · Dropout
