DeepBurning-MixQ: An Open Source Mixed-Precision Neural Network   Accelerator Design Framework for FPGAs

Erjing Luo; Haitong Huang; Cheng Liu; Guoyu Li; Bing Yang; Ying Wang,; Huawei Li; Xiaowei Li

arXiv:2308.11334·cs.AR·August 23, 2023·1 cites

DeepBurning-MixQ: An Open Source Mixed-Precision Neural Network Accelerator Design Framework for FPGAs

Erjing Luo, Haitong Huang, Cheng Liu, Guoyu Li, Bing Yang, Ying Wang,, Huawei Li, Xiaowei Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces DeepBurning-MixQ, an open source FPGA framework for mixed-precision neural networks that optimizes DSP packing and quantization, significantly improving performance and accuracy.

Contribution

It presents a systematic DSP packing algorithm and a unified NAS framework tailored for MPNNs on FPGAs, enabling efficient resource utilization and higher accuracy.

Findings

01

Achieves superior performance over handcrafted accelerators.

02

Improves resource utilization and inference accuracy.

03

Effectively supports mixed-precision neural networks on FPGAs.

Abstract

Mixed-precision neural networks (MPNNs) that enable the use of just enough data width for a deep learning task promise significant advantages of both inference accuracy and computing overhead. FPGAs with fine-grained reconfiguration capability can adapt the processing with distinct data width and models, and hence, can theoretically unleash the potential of MPNNs. Nevertheless, commodity DPUs on FPGAs mostly emphasize generality and have limited support for MPNNs especially the ones with lower data width. In addition, primitive DSPs in FPGAs usually have much larger data width than that is required by MPNNs and haven't been sufficiently co-explored with MPNNs yet. To this end, we propose an open source MPNN accelerator design framework specifically tailored for FPGAs. In this framework, we have a systematic DSP-packing algorithm to pack multiple lower data width MACs in a single…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fffasttime/anypackingnet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Neural Networks and Applications · Machine Learning and ELM