PolyLUT-Add: FPGA-based LUT Inference with Wide Inputs

Binglei Lou; Richard Rademacher; David Boland; Philip H.W. Leong

arXiv:2406.04910·cs.LG·September 17, 2024

PolyLUT-Add: FPGA-based LUT Inference with Wide Inputs

Binglei Lou, Richard Rademacher, David Boland, Philip H.W. Leong

PDF

Open Access 1 Repo

TL;DR

PolyLUT-Add enhances FPGA-based LUT neural networks by combining sub-neurons through addition, significantly reducing LUT resource usage and latency while maintaining accuracy across various benchmarks.

Contribution

It introduces PolyLUT-Add, a novel method for increasing neuron connectivity in LUT networks, along with a scalable architecture for improved FPGA deployment.

Findings

01

Achieves 2.0-13.9x LUT reduction for similar accuracy.

02

Reduces latency by 1.2-1.6x.

03

Effective across multiple benchmarks.

Abstract

FPGAs have distinct advantages as a technology for deploying deep neural networks (DNNs) at the edge. Lookup Table (LUT) based networks, where neurons are directly modeled using LUTs, help maximize this promise of offering ultra-low latency and high area efficiency on FPGAs. Unfortunately, LUT resource usage scales exponentially with the number of inputs to the LUT, restricting PolyLUT to small LUT sizes. This work introduces PolyLUT-Add, a technique that enhances neuron connectivity by combining $A$ PolyLUT sub-neurons via addition to improve accuracy. Moreover, we describe a novel architecture to improve its scalability. We evaluated our implementation over the MNIST, Jet Substructure classification, and Network Intrusion Detection benchmark and found that for similar accuracy, PolyLUT-Add achieves a LUT reduction of $2.0 - 13.9 \times$ with a $1.2 - 1.6 \times$ decrease in latency.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bingleilou/PolyLUT-Add
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReal-time simulation and control systems