MatPIM: Accelerating Matrix Operations with Memristive Stateful Logic
Orian Leitersdorf, Ronny Ronen, and Shahar Kvatinsky

TL;DR
This paper introduces novel algorithms for memristive in-memory matrix operations, significantly accelerating matrix-vector multiplication and convolution for neural networks and image processing, leveraging memristive partitions and parallelism.
Contribution
It presents the first fast in-memory binary matrix-vector multiplication and convolution algorithms, overcoming previous limitations and achieving substantial speedups.
Findings
39x faster binary matrix-vector multiplication
2x faster convolution latency
First in-memory binary algorithms for these operations
Abstract
The emerging memristive Memory Processing Unit (mMPU) overcomes the memory wall through memristive devices that unite storage and logic for real processing-in-memory (PIM) systems. At the core of the mMPU is stateful logic, which is accelerated with memristive partitions to enable logic with massive inherent parallelism within crossbar arrays. This paper vastly accelerates the fundamental operations of matrix-vector multiplication and convolution in the mMPU, with either full-precision or binary elements. These proposed algorithms establish an efficient foundation for large-scale mMPU applications such as neural-networks, image processing, and numerical methods. We overcome the inherent asymmetry limitation in the previous in-memory full-precision matrix-vector multiplication solutions by utilizing techniques from block matrix multiplication and reduction. We present the first fast…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Neural Networks and Reservoir Computing
