SiTe CiM: Signed Ternary Computing-in-Memory for Ultra-Low Precision   Deep Neural Networks

Niharika Thakuria; Akul Malhotra; Sandeep K. Thirumala; Reena; Elangovan; Anand Raghunathan; Sumeet K. Gupta

arXiv:2408.13617·cs.AR·August 27, 2024

SiTe CiM: Signed Ternary Computing-in-Memory for Ultra-Low Precision Deep Neural Networks

Niharika Thakuria, Akul Malhotra, Sandeep K. Thirumala, Reena, Elangovan, Anand Raghunathan, Sumeet K. Gupta

PDF

Open Access

TL;DR

This paper introduces SiTe CiM, a compute-in-memory design for signed ternary neural networks that significantly reduces energy and latency, enabling more efficient deep learning hardware.

Contribution

It presents a novel SiTe CiM memory architecture with two variants, achieving substantial energy savings and throughput improvements for ternary DNNs.

Findings

01

Up to 88% lower CiM latency

02

Up to 78% energy savings in CiM operations

03

Up to 7X throughput boost in DNN accelerators

Abstract

Ternary Deep Neural Networks (DNN) have shown a large potential for highly energy-constrained systems by virtue of their low power operation (due to ultra-low precision) with only a mild degradation in accuracy. To enable an energy-efficient hardware substrate for such systems, we propose a compute-enabled memory design, referred to as SiTe-CiM, which features computing-in-memory (CiM) of dot products between signed ternary (SiTe) inputs and weights. SiTe CiM is based on cross-coupling of two bit cells to enable CiM of dot products in the signed ternary regime. We explore SiTe CiM with 8T-SRAM, 3T-embedded DRAM (3T-eDRAM) and 3T-ferroelectric metal FET (FEMFET) memories. We propose two flavors of this technique, namely SiTe CiM I/II. In SiTe CiM I, we employ two additional transistors per cell for cross-coupling, achieving fast CiM operations, albeit incurring an area overhead ranging…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Neural Networks and Reservoir Computing