Block Format Error Bounds and Optimal Block Size Selection
Ilya Soloveychik, Ilya Lyubomirsky, Xin Wang, Sudeep Bhoja

TL;DR
This paper rigorously analyzes block floating point formats, deriving error bounds and optimal block sizes to enhance numerical accuracy and hardware efficiency in deep neural network computations.
Contribution
It provides the first rigorous statistical analysis of BFP and SBFP formats, including error bounds and optimal block size determination.
Findings
Asymptotic error bounds for BFP and SBFP formats.
Optimal block size of 64 for 4-bit BFP.
Validated theoretical results with neural network weight data.
Abstract
The amounts of data that need to be transmitted, processed, and stored by the modern deep neural networks have reached truly enormous volumes in the last few years calling for the invention of new paradigms both in hardware and software development. One of the most promising and rapidly advancing frontiers here is the creation of new numerical formats. In this work we focus on the family of block floating point numerical formats due to their combination of wide dynamic range, numerical accuracy, and efficient hardware implementation of inner products using simple integer arithmetic. These formats are characterized by a block of mantissas with a shared scale factor. The basic Block Floating Point (BFP) format quantizes the block scales into the nearest powers of two on the right. Its simple modification - Scaled BFP (SBFP) - stores the same scales in full precision and thus allows higher…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNumerical Methods and Algorithms · Digital Filter Design and Implementation · Model Reduction and Neural Networks
