Near-Precise Parameter Approximation for Multiple Multiplications on A   Single DSP Block

Ercan Kalali; Rene van Leuken

arXiv:2104.02162·cs.AR·October 26, 2021

Near-Precise Parameter Approximation for Multiple Multiplications on A Single DSP Block

Ercan Kalali, Rene van Leuken

PDF

TL;DR

This paper introduces a novel approximation technique for MAC operations in FPGA DSP blocks, enabling efficient multiple parameter multiplications with minimal accuracy loss and significant hardware resource savings.

Contribution

It proposes a Single DSP - Multiple Multiplication (SDMM) method that separates multiplication and accumulation, improving efficiency and enabling high compression rates in CNN implementations.

Findings

01

Achieves up to 33% parameter compression without hardware cost.

02

Reduces DSP block usage by up to 83.3% in FPGA implementations.

03

Maintains accuracy in CNNs with minimal loss across various precisions.

Abstract

A multiply-accumulate (MAC) operation is the main computation unit for DSP applications. DSP blocks are one of the efficient solutions to implement MACs in FPGA's. However, since the DSP blocks have wide multiplier and adder blocks, MAC operations using low bit-length parameters lead to an underutilization problem. Hence, an efficient approximation technique is introduced. The technique includes manipulation and approximation of the low bit-length fixed-point parameters based upon a Single DSP - Multiple Multiplication (SDMM) execution. The SDMM changes the traditional MAC implementation in the DSP block by separating multiplication and accumulation operations. While the accumulator hardware available in the DSP block is used for multiple parameter multiplication, parallel LUTs are employed for the accumulation part of the MAC operation. The accuracy of the developed optimization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.