ATRIA: A Bit-Parallel Stochastic Arithmetic Based Accelerator for In-DRAM CNN Processing
Supreeth Mysore Shivanandamurthy, Ishan. G. Thakkar, Sayed Ahmad, Salehi

TL;DR
ATRIA is a novel in-DRAM CNN accelerator that uses bit-parallel stochastic arithmetic to significantly enhance inference speed and efficiency with minimal accuracy loss, outperforming previous in-DRAM solutions.
Contribution
Introduces ATRIA, a new in-DRAM accelerator employing light-weight modifications for bit-parallel stochastic arithmetic, achieving high-speed CNN inference with improved efficiency.
Findings
Up to 3.2x increase in frames-per-second (FPS)
Up to 10x improvement in efficiency (FPS/W/mm2)
Only 3.5% accuracy drop in CNN inference
Abstract
With the rapidly growing use of Convolutional Neural Networks (CNNs) in real-world applications related to machine learning and Artificial Intelligence (AI), several hardware accelerator designs for CNN inference and training have been proposed recently. In this paper, we present ATRIA, a novel bit-pArallel sTochastic aRithmetic based In-DRAM Accelerator for energy-efficient and high-speed inference of CNNs. ATRIA employs light-weight modifications in DRAM cell arrays to implement bit-parallel stochastic arithmetic based acceleration of multiply-accumulate (MAC) operations inside DRAM. ATRIA significantly improves the latency, throughput, and efficiency of processing CNN inferences by performing 16 MAC operations in only five consecutive memory operation cycles. We mapped the inference tasks of four benchmark CNNs on ATRIA to compare its performance with five state-of-the-art in-DRAM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
