Towards automated generation of fast and accurate algorithms for recursive matrix multiplication
Jean-Guillaume Dumas (CASC), Cl\'ement Pernet (CASC), Alexandre Sedoglavic (CRIStAL)

TL;DR
This paper introduces a novel strategy for automatically generating fast, accurate recursive matrix multiplication algorithms by optimizing stability bounds, operation count, and basis sparsification, exemplified by improved 2x2 matrix multiplication algorithms.
Contribution
It presents a comprehensive, three-phase strategy for designing recursive matrix multiplication algorithms that optimize stability, efficiency, and accuracy, including a new 2x2 algorithm with better practical accuracy.
Findings
Proposed a 2x2 matrix multiplication algorithm with 7 products outperforming previous methods.
Developed heuristics that reduce operation count while maintaining accuracy.
Applied basis sparsification to improve time complexity constants.
Abstract
We propose a strategy for the generation of fast and accurate versions of non-commutative recursive matrix multiplication algorithms. To generate these algorithms, we consider matrix and tensor norm bounds governing the stability and accuracy of numerical matrix multiplication. We start by a unification on known max-norm bounds on matrix multiplication stability and then extend them to further norms and more generally to recursive bilinear algorithms and the alternative basis matrix multiplication algorithms. Then our strategy has three phases. First, we reduce those bounds by minimizing a growth factor along the orbits of the associated matrix multiplication tensor decomposition. Second, we develop heuristics that minimize the number of operations required to realize a bilinear formula, while further improving its accuracy. Third, we perform an alternative basis sparsification that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTensor decomposition and applications · Parallel Computing and Optimization Techniques · Matrix Theory and Algorithms
