TL;DR
MultPIM introduces a novel in-memory multiplication algorithm leveraging stateful logic and partition-based techniques, achieving significant latency reductions and efficiency improvements for 32-bit and matrix-vector multiplications.
Contribution
It proposes a new in-memory multiplication method with a carry-save add-shift technique and an improved full-adder, reducing complexity and latency compared to prior algorithms.
Findings
Reduces multiplication time complexity from quadratic to linear-log
Achieves 4.2x latency improvement over RIME for 32-bit numbers
Improves matrix-vector multiplication latency by 25.5x over FloatPIM
Abstract
Processing-in-memory (PIM) seeks to eliminate computation/memory data transfer using devices that support both storage and logic. Stateful logic techniques such as IMPLY, MAGIC and FELIX can perform logic gates within memristive crossbar arrays with massive parallelism. Multiplication via stateful logic is an active field of research due to the wide implications. Recently, RIME has become the state-of-the-art algorithm for stateful single-row multiplication by using memristive partitions, reducing the latency of the previous state-of-the-art by 5.1x. In this paper, we begin by proposing novel partition-based computation techniques for broadcasting and shifting data. Then, we design an in-memory multiplication algorithm based on the carry-save add-shift (CSAS) technique. Finally, we develop a novel stateful full-adder that significantly improves the state-of-the-art (FELIX) design. These…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
