TiM-DNN: Ternary in-Memory accelerator for Deep Neural Networks

Shubham Jain; Sumeet Kumar Gupta; Anand Raghunathan

arXiv:1909.06892·cs.LG·May 6, 2020

TiM-DNN: Ternary in-Memory accelerator for Deep Neural Networks

Shubham Jain, Sumeet Kumar Gupta, Anand Raghunathan

PDF

TL;DR

TiM-DNN is a programmable in-memory accelerator optimized for ternary deep neural networks, achieving significant improvements in energy efficiency and performance over GPUs and specialized accelerators.

Contribution

This paper introduces TiM-DNN, a novel in-memory accelerator specifically designed for ternary DNNs, supporting multiple ternary representations and demonstrating superior efficiency.

Findings

01

Achieves 114 TOPs/s peak performance with 0.9W power consumption.

02

Outperforms NVIDIA Tesla V100 GPU by 300X in TOPS/W.

03

Outperforms specialized DNN accelerators by up to 240X in TOPS/W.

Abstract

The use of lower precision has emerged as a popular technique to optimize the compute and storage requirements of complex Deep Neural Networks (DNNs). In the quest for lower precision, recent studies have shown that ternary DNNs (which represent weights and activations by signed ternary values) represent a promising sweet spot, achieving accuracy close to full-precision networks on complex tasks. We propose TiM-DNN, a programmable in-memory accelerator that is specifically designed to execute ternary DNNs. TiM-DNN supports various ternary representations including unweighted {-1,0,1}, symmetric weighted {-a,0,a}, and asymmetric weighted {-a,0,b} ternary systems. The building blocks of TiM-DNN are TiM tiles -- specialized memory arrays that perform massively parallel signed ternary vector-matrix multiplications with a single access. TiM tiles are in turn composed of Ternary Processing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.