Fast Matrix Multiplication via Ternary Meta Flip Graphs
A. I. Perminov

TL;DR
This paper presents a GPU-accelerated method for discovering efficient matrix multiplication schemes with coefficients in {-1, 0, 1}, achieving new best ranks and expanding the understanding of ternary coefficient implementations.
Contribution
It introduces a novel meta flip graph algorithm that efficiently finds ternary matrix multiplication schemes, improving known ranks and providing open-source tools and results.
Findings
New best ranks for specific matrix formats.
Discovered 32 schemes matching known optimal ranks.
Achieved 30 rank improvements in the binary field.
Abstract
Matrix multiplication optimization remains a fundamental challenge in computational mathematics. This work introduces a novel approach that discovers matrix multiplication schemes whose coefficients are restricted to the set (denoted ), minimizing naive additive complexity for efficient hardware implementation. The core of the method is a GPU-accelerated meta flip graph algorithm that maintains ternary safety through specialized arithmetic operations and sign symmetry breaking. Key results include new best ranks for the formats , , and , the independent discovery of 32 schemes in that match known optimal ranks (including 8 previously known only with rational coefficients), and 30 rank improvements in the binary field. The analysis of 164 known schemes shows that 92 admit a ternary-coefficient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPolynomial and algebraic computation · Cryptography and Residue Arithmetic · Tensor decomposition and applications
