Accelerating Scientific Computations with Mixed Precision Algorithms

Marc Baboulin; Alfredo Buttari; Jack Dongarra; Jakub Kurzak; Julie; Langou; Julien Langou; Piotr Luszczek; and Stanimire Tomov

arXiv:0808.2794·cs.MS·May 13, 2015

Accelerating Scientific Computations with Mixed Precision Algorithms

Marc Baboulin, Alfredo Buttari, Jack Dongarra, Jakub Kurzak, Julie, Langou, Julien Langou, Piotr Luszczek, and Stanimire Tomov

PDF

TL;DR

This paper explores mixed precision algorithms that combine 32-bit and 64-bit floating point operations to accelerate scientific computations across various modern hardware architectures while preserving solution accuracy.

Contribution

It introduces a novel approach to enhance computational performance using mixed precision arithmetic applicable to diverse hardware platforms.

Findings

01

Performance gains with mixed precision algorithms

02

Maintained 64-bit solution accuracy

03

Effective on CPUs, GPUs, FPGAs, and STI Cell BE

Abstract

On modern architectures, the performance of 32-bit operations is often at least twice as fast as the performance of 64-bit operations. By using a combination of 32-bit and 64-bit floating point arithmetic, the performance of many dense and sparse linear algebra algorithms can be significantly enhanced while maintaining the 64-bit accuracy of the resulting solution. The approach presented here can apply not only to conventional processors but also to other technologies such as Field Programmable Gate Arrays (FPGA), Graphical Processing Units (GPU), and the STI Cell BE processor. Results on modern processor architectures and the STI Cell BE are presented.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.