On the precision attainable with various floating-point number systems

Richard P. Brent

arXiv:1004.3374·cs.NA·April 21, 2010

On the precision attainable with various floating-point number systems

Richard P. Brent

PDF

Open Access

TL;DR

This paper compares the numerical accuracy of various floating-point number systems, analyzing how choices like base and normalization affect precision in scientific computations through theoretical and simulation results.

Contribution

It provides a comparative analysis of floating-point systems, highlighting optimal base choices for minimizing roundoff errors based on theoretical and simulation evidence.

Findings

01

Base 2 minimizes mean square roundoff error when leading fraction bit is not stored.

02

Base 4 or 8 are optimal when the first bit must be stored explicitly.

03

Theoretical and simulation results support the choice of base for accuracy.

Abstract

For scientific computations on a digital computer the set of real number is usually approximated by a finite set F of "floating-point" numbers. We compare the numerical accuracy possible with difference choices of F having approximately the same range and requiring the same word length. In particular, we compare different choices of base (or radix) in the usual floating-point systems. The emphasis is on the choice of F, not on the details of the number representation or the arithmetic, but both rounded and truncated arithmetic are considered. Theoretical results are given, and some simulations of typical floating-point computations (forming sums, solving systems of linear equations, finding eigenvalues) are described. If the leading fraction bit of a normalized base 2 number is not stored explicitly (saving a bit), and the criterion is to minimize the mean square roundoff error, then…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNumerical Methods and Algorithms · Digital Filter Design and Implementation · Algorithms and Data Compression