Error Analysis of Matrix Multiplication Emulation Using Ozaki-II Scheme
Yuki Uchino, Katsuhisa Ozaki, Toshiyuki Imamura

TL;DR
This paper provides a detailed error analysis of the Ozaki-II matrix multiplication emulation scheme, clarifying its accuracy behavior and estimating the number of low-precision multiplications needed for high-precision results.
Contribution
It offers the first rigorous deterministic error analysis of the Ozaki-II scheme, enhancing understanding of its accuracy and efficiency.
Findings
Error behavior depends on input exponent distribution.
The analysis enables estimation of required low-precision multiplications.
High throughput achieved with INT8 hardware can be affected by accuracy degradation.
Abstract
The Ozaki-II scheme is an emulation method that leverages the Chinese Remainder Theorem to compute high-precision matrix multiplication via a sequence of low-precision matrix multiplications. In this scheme, the attainable numerical accuracy improves as the number of low-precision matrix multiplications increases. Previous numerical studies have shown that single- and double-precision matrix multiplication using the Ozaki-II scheme achieves higher throughput than that of standard BLAS routines on modern AI hardware equipped with fast INT8 matrix multiply-accumulate units with INT8 inputs and INT32 accumulation. However, the accuracy of the Ozaki-II scheme can degrade when the exponent distribution of the input matrices is wide, in which case a large number of low-precision matrix multiplications is required to obtain high-precision results. In this paper, we present a rigorous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNumerical Methods and Algorithms · Cryptography and Residue Arithmetic · Parallel Computing and Optimization Techniques
