AIDA: Associative DNN Inference Accelerator

Leonid Yavits; Roman Kaplan; Ran Ginosar

arXiv:1901.04976·cs.DC·January 16, 2019·1 cites

AIDA: Associative DNN Inference Accelerator

Leonid Yavits, Roman Kaplan, Ran Ginosar

PDF

Open Access

TL;DR

AIDA is an innovative associative in-memory DNN inference engine that accelerates fully-connected layers by processing data in-situ, leveraging sparsity and low precision for significant performance and efficiency gains.

Contribution

The paper introduces AIDA, a novel associative in-memory processor for DNN inference that outperforms existing accelerators like EIE in performance and energy efficiency.

Findings

01

AIDA achieves 14.5x peak performance over EIE.

02

AIDA is 2.5x more throughput-efficient.

03

AIDA benefits from sparsity and low-precision arithmetic.

Abstract

We propose AIDA, an inference engine for accelerating fully-connected (FC) layers of Deep Neural Network (DNN). AIDA is an associative in-memory processor, where the bulk of data never leaves the confines of the memory arrays, and processing is performed in-situ. AIDA area and energy efficiency strongly benefit from sparsity and lower arithmetic precision. We show that AIDA outperforms the state of art inference accelerator, EIE, by 14.5x (peak performance) and 2.5x (throughput).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Advanced Neural Network Applications · Ferroelectric and Negative Capacitance Devices