PolyLUT-Add: FPGA-based LUT Inference with Wide Inputs
Binglei Lou, Richard Rademacher, David Boland, Philip H.W. Leong

TL;DR
PolyLUT-Add enhances FPGA-based LUT neural networks by combining sub-neurons through addition, significantly reducing LUT resource usage and latency while maintaining accuracy across various benchmarks.
Contribution
It introduces PolyLUT-Add, a novel method for increasing neuron connectivity in LUT networks, along with a scalable architecture for improved FPGA deployment.
Findings
Achieves 2.0-13.9x LUT reduction for similar accuracy.
Reduces latency by 1.2-1.6x.
Effective across multiple benchmarks.
Abstract
FPGAs have distinct advantages as a technology for deploying deep neural networks (DNNs) at the edge. Lookup Table (LUT) based networks, where neurons are directly modeled using LUTs, help maximize this promise of offering ultra-low latency and high area efficiency on FPGAs. Unfortunately, LUT resource usage scales exponentially with the number of inputs to the LUT, restricting PolyLUT to small LUT sizes. This work introduces PolyLUT-Add, a technique that enhances neuron connectivity by combining PolyLUT sub-neurons via addition to improve accuracy. Moreover, we describe a novel architecture to improve its scalability. We evaluated our implementation over the MNIST, Jet Substructure classification, and Network Intrusion Detection benchmark and found that for similar accuracy, PolyLUT-Add achieves a LUT reduction of with a decrease in latency.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReal-time simulation and control systems
