LUT-DLA: Lookup Table as Efficient Extreme Low-Bit Deep Learning Accelerator
Guoyu Li (1, 2), Shengyu Ye (2), Chunyun Chen (3), Yang Wang (2),, Fan Yang (2), Ting Cao (2), Cheng Liu (1), Mohamed M. Sabry (3), Mao Yang (2), ((1) University of Chinese Academy of Sciences, (2) Microsoft Research, (3), NTU Singapore)

TL;DR
LUT-DLA introduces a look-up table framework utilizing vector quantization for extreme low-bit deep learning, significantly reducing computational and hardware costs while maintaining acceptable accuracy levels.
Contribution
It presents a novel LUT-based deep learning accelerator framework with multistage training and co-design optimization for enhanced efficiency.
Findings
Achieves 1.4-7.0x power efficiency improvements.
Attains 1.5-146.1x area efficiency gains.
Maintains modest accuracy drops across models.
Abstract
The emergence of neural network capabilities invariably leads to a significant surge in computational demands due to expanding model sizes and increased computational complexity. To reduce model size and lower inference costs, recent research has focused on simplifying models and designing hardware accelerators using low-bit quantization. However, due to numerical representation limits, scalar quantization cannot reduce bit width lower than 1-bit, diminishing its benefits. To break through these limitations, we introduce LUT-DLA, a Look-Up Table (LUT) Deep Learning Accelerator Framework that utilizes vector quantization to convert neural network models into LUTs, achieving extreme low-bit quantization. The LUT-DLA framework facilitates efficient and cost-effective hardware accelerator designs and supports the LUTBoost algorithm, which helps to transform various DNN models into LUT-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing Techniques and Applications · CCD and CMOS Imaging Sensors · Medical Imaging Techniques and Applications
