NullaNet Tiny: Ultra-low-latency DNN Inference Through Fixed-function   Combinational Logic

Mahdi Nazemi; Arash Fayyazi; Amirhossein Esmaili; Atharva Khare,; Soheil Nazar Shahsavani; and Massoud Pedram

arXiv:2104.05421·cs.LG·April 13, 2021

NullaNet Tiny: Ultra-low-latency DNN Inference Through Fixed-function Combinational Logic

Mahdi Nazemi, Arash Fayyazi, Amirhossein Esmaili, Atharva Khare,, Soheil Nazar Shahsavani, and Massoud Pedram

PDF

Open Access

TL;DR

NullaNet Tiny introduces an FPGA-based neural network accelerator that replaces complex operations with Boolean logic, achieving ultra-low latency and high resource efficiency for applications demanding sub-microsecond response times.

Contribution

The paper presents a novel across-the-stack framework that maps neural network operations to FPGA LUTs, significantly reducing latency and resource usage compared to existing FPGA accelerators.

Findings

01

Achieves 2.36× lower latency than LogicNets.

02

Uses 24.42× fewer LUTs than LogicNets.

03

Maintains similar classification accuracy.

Abstract

While there is a large body of research on efficient processing of deep neural networks (DNNs), ultra-low-latency realization of these models for applications with stringent, sub-microsecond latency requirements continues to be an unresolved, challenging problem. Field-programmable gate array (FPGA)-based DNN accelerators are gaining traction as a serious contender to replace graphics processing unit/central processing unit-based platforms considering their performance, flexibility, and energy efficiency. This paper presents NullaNet Tiny, an across-the-stack design and optimization framework for constructing resource and energy-efficient, ultra-low-latency FPGA-based neural network accelerators. The key idea is to replace expensive operations required to compute various filter/neuron functions in a DNN with Boolean logic expressions that are mapped to the native look-up tables (LUTs)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Advanced Neural Network Applications · Advanced Memory and Neural Computing