Scalable and RISC-V Programmable Near-Memory Computing Architectures for Edge Nodes
Michele Caon (1), Cl\'ement Chon\'e (2), Pasquale Davide Schiavone, (2), Alexandre Levisse (2), Guido Masera (1), Maurizio Martina (1), David, Atienza (2) ((1) Politecnico di Torino, (2) \'Ecole Polytechnique, F\'ed\'erale de Lausanne (EPFL))

TL;DR
This paper introduces scalable, RISC-V compatible near-memory computing architectures tailored for edge nodes, significantly enhancing energy efficiency and performance for AI/ML workloads in resource-constrained environments.
Contribution
It proposes a novel, flexible, and low-effort NMC architecture with two variants, NM-Caesar and NM-Carus, optimized for edge computing and integrated with RISC-V systems.
Findings
Up to 28x faster execution time compared to RISC-V CPU.
Up to 36x higher energy efficiency at system level.
NM-Carus achieves 306.7 GOPS/W in 8-bit matrix multiplications.
Abstract
The widespread adoption of data-centric algorithms, particularly Artificial Intelligence (AI) and Machine Learning (ML), has exposed the limitations of centralized processing infrastructures, driving a shift towards edge computing. This necessitates stringent constraints on energy efficiency, which traditional von Neumann architectures struggle to meet. The Compute-In-Memory (CIM) paradigm has emerged as a superior candidate due to its efficient exploitation of available memory bandwidth. However, existing CIM solutions require high implementation effort and lack flexibility from a software integration standpoint. This work proposes a novel, software-friendly, general-purpose, and low-integration-effort Near-Memory Computing (NMC) approach, paving the way for the adoption of CIM-based systems in the next generation of edge computing nodes. Two architectural variants, NM-Caesar and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing
