DISCA: A Digital In-memory Stochastic Computing Architecture Using A Compressed Bent-Pyramid Format
Shady Agwa, Yikang Shen, Shiwei Wang, Themis Prodromakis

TL;DR
DISCA introduces a digital in-memory stochastic computing architecture using a compressed Bent-Pyramid format, achieving high energy efficiency for matrix multiplication in AI applications at the edge.
Contribution
It presents a novel digital in-memory stochastic computing architecture that combines the simplicity of analog with the scalability of digital systems, using a new data format.
Findings
Energy efficiency of 3.59 TOPS/W per bit at 500 MHz in 180 nm CMOS.
Significant energy efficiency improvements over existing architectures.
Scalable and reliable digital in-memory computing for AI workloads.
Abstract
Nowadays, we are witnessing an Artificial Intelligence revolution that dominates the technology landscape in various application domains, such as healthcare, robotics, automotive, security, and defense. Massive-scale AI models, which mimic the human brain's functionality, typically feature millions and even billions of parameters through data-intensive matrix multiplication tasks. While conventional Von-Neumann architectures struggle with the memory wall and the end of Moore's Law, these AI applications are migrating rapidly towards the edge, such as in robotics and unmanned aerial vehicles for surveillance, thereby adding more constraints to the hardware budget of AI architectures at the edge. Although in-memory computing has been proposed as a promising solution for the memory wall, both analog and digital in-memory computing architectures suffer from substantial degradation of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
