An Event-Based Digital Compute-In-Memory Accelerator with Flexible Operand Resolution and Layer-Wise Weight/Output Stationarity
Nicolas Chauvaux, Adrian Kneip, Christoph Posch, Kofi Makinwa and, Charlotte Frenkel

TL;DR
This paper introduces a flexible digital compute-in-memory accelerator for spiking neural networks that enhances energy efficiency and resolution configurability, enabling practical edge vision applications with high accuracy.
Contribution
It presents a novel digital CIM macro supporting arbitrary operand resolution and shape, with a hybrid dataflow for improved operand reuse and energy savings.
Findings
2× energy efficiency improvement over prior digital CIM-SNNs
Supports bitwise resolution reconfiguration
Achieves 95.8% accuracy on IBM DVS gesture dataset
Abstract
Compute-in-memory (CIM) accelerators for spiking neural networks (SNNs) are promising solutions to enable s-level inference latency and ultra-low energy in edge vision applications. Yet, their current lack of flexibility at both the circuit and system levels prevents their deployment in a wide range of real-life scenarios. In this work, we propose a novel digital CIM macro that supports arbitrary operand resolution and shape, with a unified CIM storage for weights and membrane potentials. These circuit-level techniques enable a hybrid weight- and output-stationary dataflow at the system level to maximize operand reuse, thereby minimizing costly on- and off-chip data movements during the SNN execution. Measurement results of a fabricated FlexSpIM prototype in 40-nm CMOS demonstrate a 2 increase in bit-normalized energy efficiency compared to prior fixed-precision digital…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Parallel Computing and Optimization Techniques · Distributed systems and fault tolerance
MethodsSpiking Neural Networks
