NullaNet Tiny: Ultra-low-latency DNN Inference Through Fixed-function Combinational Logic
Mahdi Nazemi, Arash Fayyazi, Amirhossein Esmaili, Atharva Khare,, Soheil Nazar Shahsavani, and Massoud Pedram

TL;DR
NullaNet Tiny introduces an FPGA-based neural network accelerator that replaces complex operations with Boolean logic, achieving ultra-low latency and high resource efficiency for applications demanding sub-microsecond response times.
Contribution
The paper presents a novel across-the-stack framework that maps neural network operations to FPGA LUTs, significantly reducing latency and resource usage compared to existing FPGA accelerators.
Findings
Achieves 2.36× lower latency than LogicNets.
Uses 24.42× fewer LUTs than LogicNets.
Maintains similar classification accuracy.
Abstract
While there is a large body of research on efficient processing of deep neural networks (DNNs), ultra-low-latency realization of these models for applications with stringent, sub-microsecond latency requirements continues to be an unresolved, challenging problem. Field-programmable gate array (FPGA)-based DNN accelerators are gaining traction as a serious contender to replace graphics processing unit/central processing unit-based platforms considering their performance, flexibility, and energy efficiency. This paper presents NullaNet Tiny, an across-the-stack design and optimization framework for constructing resource and energy-efficient, ultra-low-latency FPGA-based neural network accelerators. The key idea is to replace expensive operations required to compute various filter/neuron functions in a DNN with Boolean logic expressions that are mapped to the native look-up tables (LUTs)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Advanced Neural Network Applications · Advanced Memory and Neural Computing
