Operand Folding Hardware Multipliers
Byungchun Chung, Sandra Marcello, Amir-Pasha Mirbaha, David, Naccache, Karim Sabeg

TL;DR
This paper introduces a novel accumulate-and-add multiplication algorithm that partitions operands to achieve a more compact and faster hardware multiplier, significantly reducing the number of additions needed for large bit-lengths.
Contribution
The paper presents a new operand partitioning method for accumulate-and-add multiplication, reducing addition count and improving hardware efficiency for large operands.
Findings
Requires only 0.194m+56 additions for 1024-bit operands
Approximately halves the additions compared to classical methods
Results in more compact and faster hardware multipliers
Abstract
This paper describes a new accumulate-and-add multiplication algorithm. The method partitions one of the operands and re-combines the results of computations done with each of the partitions. The resulting design turns-out to be both compact and fast. When the operands' bit-length is 1024, the new algorithm requires only additions (on average), this is about half the number of additions required by the classical accumulate-and-add multiplication algorithm ().
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNumerical Methods and Algorithms · Low-power high-performance VLSI design · Coding theory and cryptography
