Enhancing LUT-based Deep Neural Networks Inference through Architecture and Connectivity Optimization

Binglei Lou; Ruilin Wu; Philip Leong

arXiv:2601.09773·cs.AR·January 16, 2026

Enhancing LUT-based Deep Neural Networks Inference through Architecture and Connectivity Optimization

Binglei Lou, Ruilin Wu, Philip Leong

PDF

Open Access

TL;DR

This paper introduces SparseLUT, a framework that improves LUT-based DNN inference by reducing LUT size and optimizing neuron connectivity, leading to faster, more resource-efficient edge deployment without sacrificing accuracy.

Contribution

The paper presents architectural enhancements and a non-greedy training algorithm that together optimize LUT-based DNNs for resource-constrained devices, a novel combination in this context.

Findings

01

LUT consumption reduced by up to 13.9x

02

Inference latency decreased by up to 1.6x

03

Accuracy improved by up to 2.13% on MNIST

Abstract

Deploying deep neural networks (DNNs) on resource-constrained edge devices such as FPGAs requires a careful balance among latency, power, and hardware resource usage, while maintaining high accuracy. Existing Lookup Table (LUT)-based DNNs -- such as LogicNets, PolyLUT, and NeuraLUT -- face two critical challenges: the exponential growth of LUT size and inefficient random sparse connectivity. This paper presents SparseLUT, a comprehensive framework that addresses these challenges through two orthogonal optimizations. First, we propose an architectural enhancement that aggregates multiple PolyLUT sub-neurons via an adder, significantly reducing LUT consumption by 2.0x-13.9x and lowering inference latency by 1.2x-1.6x, all while maintaining comparable accuracy. Building upon this foundation, we further introduce a non-greedy training algorithm that optimizes neuron connectivity by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Big Data and Digital Economy