StoX-Net: Stochastic Processing of Partial Sums for Efficient In-Memory Computing DNN Accelerators
Ethan G Rogers, Sohan Salahuddin Mugdho, Kshemal Kshemendra Gupte, and, Cheng Wang

TL;DR
This paper introduces StoX-Net, a stochastic processing approach for in-memory DNN accelerators that significantly reduces energy, latency, and area overhead by eliminating ADCs through probabilistic partial sum processing.
Contribution
It proposes a novel stochastic partial sum processing method using spin-orbit torque magnetic tunnel junctions, along with training techniques to maintain accuracy, enabling efficient in-memory computing without ADCs.
Findings
Achieves up to 16x energy reduction
Reduces latency by 8x
Improves area efficiency by 10x
Abstract
Crossbar-based in-memory computing (IMC) has emerged as a promising platform for hardware acceleration of deep neural networks (DNNs). However, the energy and latency of IMC systems are dominated by the large overhead of the peripheral analog-to-digital converters (ADCs). To address such ADC bottleneck, here we propose to implement stochastic processing of array-level partial sums (PS) for efficient IMC. Leveraging the probabilistic switching of spin-orbit torque magnetic tunnel junctions, the proposed PS processing eliminates the costly ADC, achieving significant improvement in energy and area efficiency. To mitigate accuracy loss, we develop PS-quantization-aware training that enables backward propagation across stochastic PS. Furthermore, a novel scheme with an inhomogeneous sampling length of the stochastic conversion is proposed. When running ResNet20 on the CIFAR-10 dataset, our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Neural Networks and Applications · Stochastic Gradient Optimization Techniques
