Enhancing LUT-based Deep Neural Networks Inference through Architecture and Connectivity Optimization
Binglei Lou, Ruilin Wu, Philip Leong

TL;DR
This paper introduces SparseLUT, a framework that improves LUT-based DNN inference by reducing LUT size and optimizing neuron connectivity, leading to faster, more resource-efficient edge deployment without sacrificing accuracy.
Contribution
The paper presents architectural enhancements and a non-greedy training algorithm that together optimize LUT-based DNNs for resource-constrained devices, a novel combination in this context.
Findings
LUT consumption reduced by up to 13.9x
Inference latency decreased by up to 1.6x
Accuracy improved by up to 2.13% on MNIST
Abstract
Deploying deep neural networks (DNNs) on resource-constrained edge devices such as FPGAs requires a careful balance among latency, power, and hardware resource usage, while maintaining high accuracy. Existing Lookup Table (LUT)-based DNNs -- such as LogicNets, PolyLUT, and NeuraLUT -- face two critical challenges: the exponential growth of LUT size and inefficient random sparse connectivity. This paper presents SparseLUT, a comprehensive framework that addresses these challenges through two orthogonal optimizations. First, we propose an architectural enhancement that aggregates multiple PolyLUT sub-neurons via an adder, significantly reducing LUT consumption by 2.0x-13.9x and lowering inference latency by 1.2x-1.6x, all while maintaining comparable accuracy. Building upon this foundation, we further introduce a non-greedy training algorithm that optimizes neuron connectivity by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Big Data and Digital Economy
