Efficient Non-sequential Division for FPGAs

Michael Lunglmayr

arXiv:2105.05747·eess.SP·September 12, 2022

Efficient Non-sequential Division for FPGAs

Michael Lunglmayr

PDF

Open Access

TL;DR

This paper introduces an FPGA-specific method for fast, low-complexity division that approximates the reciprocal with high precision in a single clock cycle, suitable for machine learning accelerators.

Contribution

It presents a novel non-sequential division approach for FPGAs using a low complexity reciprocal approximation and correction, enabling high-speed, low-resource division operations.

Findings

01

Achieves division in one clock cycle with low hardware cost

02

Provides arbitrarily low approximation errors

03

Demonstrates superior speed and resource efficiency compared to existing methods

Abstract

The division operation is important for many areas of data processing. Especially considering today's demand for hardware accelerators for machine learning algorithms, there is a high demand for an efficient calculation of the division function, e.g. for averaging operations or the online calculation of activation functions. For such algorithms, which are often iterative in nature, one would like to have a non-sequential way of calculating the division operation. The work presents such an approach especially tailored to FPGAs as processing platforms. It is based on an efficient way of calculating the reciprocal operation, based on a low complexity approximation combined with a correction function. The described approach allows approximating the division operation (with errors that can be made arbitrarily low), within one clock cycle using only low hardware requirements. These hardware…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNumerical Methods and Algorithms · Low-power high-performance VLSI design · Digital Filter Design and Implementation