Xcel-RAM: Accelerating Binary Neural Networks in High-Throughput SRAM Compute Arrays
Amogh Agrawal, Akhilesh Jaiswal, Deboleena Roy, Bing Han,, Gopalakrishnan Srinivasan, Aayush Ankit, Kaushik Roy

TL;DR
This paper introduces Xcel-RAM, a novel SRAM-based architecture that accelerates binary neural network computations by integrating binary convolutions directly within SRAM arrays, significantly improving energy efficiency and throughput.
Contribution
It proposes a sectioned SRAM design with charge sharing XNOR and popcount operations, enabling efficient in-memory binary neural network acceleration.
Findings
Energy consumption reduced by 2.5x
Performance improved by 4x
Achieved 1.914pJ energy per operation and 45ns delay
Abstract
Deep neural networks are a biologically-inspired class of algorithms that have recently demonstrated state-of-the-art accuracies involving large-scale classification and recognition tasks. Indeed, a major landmark that enables efficient hardware accelerators for deep networks is the recent advances from the machine learning community that have demonstrated aggressively scaled deep binary networks with state-of-the-art accuracies. In this paper, we demonstrate how deep binary networks can be accelerated in modified von-Neumann machines by enabling binary convolutions within the SRAM array. In general, binary convolutions consist of bit-wise XNOR followed by a population-count (popcount). We present a charge sharing XNOR and popcount operation in 10 transistor SRAM cells. We have employed multiple circuit techniques including dual-read-worldines (Dual-RWL) along with a dual-stage ADC that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
