Fair and Square: Replacing One Real Multiplication with a Single Square and One Complex Multiplication with Three Squares When Performing Matrix Multiplication and Convolutions
Vincenzo Liguori

TL;DR
This paper introduces methods to replace real and complex multiplications with fewer squaring operations in matrix multiplications and convolutions, significantly reducing hardware resource requirements.
Contribution
It presents novel techniques to replace multiplications with squarings, leading to resource-efficient hardware architectures for matrix operations and convolutions.
Findings
Single real multiplication replaced with one square
Single complex multiplication replaced with three squares
Hardware architectures demonstrate resource savings
Abstract
This paper shows that, for matrix multiplications and convolutions, it is possible to asymptotically replace each real multiplication with a single squaring operation. Similarly, a single complex multiplication can be replaced with 3 squaring operations. Given that an n bits squaring circuit requires about half the gate count of an nxn multiplier, this results in large resource reductions. With some caveats, the same techniques can apply to dot products, transformations and more. A varietiy of architectures implementing these ideas in hardware are described, including square based systolic arrays and tensor cores.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCryptography and Residue Arithmetic · Low-power high-performance VLSI design · Numerical Methods and Algorithms
