Block Format Error Bounds and Optimal Block Size Selection

Ilya Soloveychik; Ilya Lyubomirsky; Xin Wang; Sudeep Bhoja

arXiv:2210.05470·cs.LG·November 8, 2022

Block Format Error Bounds and Optimal Block Size Selection

Ilya Soloveychik, Ilya Lyubomirsky, Xin Wang, Sudeep Bhoja

PDF

Open Access

TL;DR

This paper rigorously analyzes block floating point formats, deriving error bounds and optimal block sizes to enhance numerical accuracy and hardware efficiency in deep neural network computations.

Contribution

It provides the first rigorous statistical analysis of BFP and SBFP formats, including error bounds and optimal block size determination.

Findings

01

Asymptotic error bounds for BFP and SBFP formats.

02

Optimal block size of 64 for 4-bit BFP.

03

Validated theoretical results with neural network weight data.

Abstract

The amounts of data that need to be transmitted, processed, and stored by the modern deep neural networks have reached truly enormous volumes in the last few years calling for the invention of new paradigms both in hardware and software development. One of the most promising and rapidly advancing frontiers here is the creation of new numerical formats. In this work we focus on the family of block floating point numerical formats due to their combination of wide dynamic range, numerical accuracy, and efficient hardware implementation of inner products using simple integer arithmetic. These formats are characterized by a block of mantissas with a shared scale factor. The basic Block Floating Point (BFP) format quantizes the block scales into the nearest powers of two on the right. Its simple modification - Scaled BFP (SBFP) - stores the same scales in full precision and thus allows higher…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNumerical Methods and Algorithms · Digital Filter Design and Implementation · Model Reduction and Neural Networks