Efficient Floating-Point Givens Rotation Unit

Javier Hormigo; Sergio D. Mu\~noz

arXiv:2010.12376·cs.AR·October 26, 2020

Efficient Floating-Point Givens Rotation Unit

Javier Hormigo, Sergio D. Mu\~noz

PDF

TL;DR

This paper introduces an efficient floating-point Givens rotation unit for QR decomposition, optimized for embedded systems, with novel format enhancements and FPGA implementation showing significant improvements.

Contribution

It presents a high-throughput floating-point Givens rotation unit with a new Half-Unit Biased format, improving hardware efficiency for QR decomposition.

Findings

01

Significant reduction in area and latency compared to previous designs.

02

Enhanced throughput for embedded signal processing applications.

03

Effective error trade-offs demonstrated through analysis.

Abstract

High-throughput QR decomposition is a key operation in many advanced signal processing and communication applications. For some of these applications, using floating-point computation is becoming almost compulsory. However, there are scarce works in hardware implementations of floating-point QR decomposition for embedded systems. In this paper, we propose a very efficient high-throughput floating-point Givens rotation unit for QR decomposition. Moreover, the initial proposed design for conventional number formats is enhanced by using the new Half-Unit Biased format. The provided error analysis shows the effectiveness of our proposals and the trade-off of different implementation parameters. FPGA implementation results are also presented and a thorough comparison between both approaches. These implementation results also reveal outstanding improvements compared to other previous similar…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.