Towards Faster Feasible Matrix Multiplication by Trilinear Aggregation
Oded Schwartz, Eyal Zwecher

TL;DR
This paper introduces a new fast matrix multiplication algorithm using trilinear aggregation, improving asymptotic complexity and practical applicability for small to medium-sized matrices.
Contribution
The authors develop an improved algorithm with a lower exponent and better base case complexity, advancing the practical efficiency of fast matrix multiplication methods.
Findings
Achieved an $O(n^{2.773203})$ algorithm applicable to feasible matrix sizes.
Optimized the algorithm for small base cases starting at $n_0=28$.
Reduced additive complexity through sparse decomposition and coefficient minimization.
Abstract
Matrix multiplication is a fundamental kernel in high performance computing. Many algorithms for fast matrix multiplication can only be applied to enormous matrices () and thus cannot be used in practice. Of all algorithms applicable to feasible input, Pan's algorithm (1982) is asymptotically the fastest. We obtain an algorithm applicable to the same input sizes as Pan's algorithm. This algorithm is the fastest matrix multiplication algorithm with base case smaller than . Further, our method obtains the best asymptotic complexity for many small base cases, starting at . We also obtain better exponents for larger base cases. To construct our algorithm, we use the trilinear aggregation method. We find parts of the algorithms that are equivalent to matrix multiplication with smaller base case, and use the de Groote equivalence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Complexity and Algorithms in Graphs · Tensor decomposition and applications
