Benchmarking and modeling of analog and digital SRAM in-memory computing architectures
Pouya Houshmand, Jiacong Sun, Marian Verhelst

TL;DR
This paper compares analog and digital in-memory computing architectures for neural network acceleration, providing benchmarking, an analytical cost model, and workload scheduling insights to guide future hardware design.
Contribution
It offers a comprehensive comparison framework including architecture overview, cost modeling, and workload scheduling analysis for AIMC and DIMC.
Findings
Digital IMC generally offers higher accuracy.
Analog IMC can be more energy-efficient.
Workload scheduling impacts overall network efficiency.
Abstract
In-memory-computing is emerging as an efficient hardware paradigm for deep neural network accelerators at the edge, enabling to break the memory wall and exploit massive computational parallelism. Two design models have surged: analog in-memory-computing (AIMC) and digital in-memory-computing (DIMC), offering a different design space in terms of accuracy, efficiency and dataflow flexibility. This paper targets the fair comparison and benchmarking of both approaches to guide future designs, through a.) an overview of published architectures; b.) an analytical cost model for energy and throughput; c.) scheduling of workloads on a variety of modeled IMC architectures for end-to-end network efficiency analysis, offering valuable workload-hardware co-design insights.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Semiconductor materials and devices
