Hardware Approximate Techniques for Deep Neural Network Accelerators: A   Survey

Giorgos Armeniakos; Georgios Zervakis; Dimitrios Soudris; J\"org; Henkel

arXiv:2203.08737·cs.AR·March 18, 2022

Hardware Approximate Techniques for Deep Neural Network Accelerators: A Survey

Giorgos Armeniakos, Georgios Zervakis, Dimitrios Soudris, J\"org, Henkel

PDF

TL;DR

This survey reviews hardware approximation techniques for DNN accelerators, analyzing their types, evaluation complexity, and potential to improve energy efficiency, reliability, and security in neural network inference.

Contribution

It provides a comprehensive classification and analysis of hardware approximation methods for DNNs, including evaluation metrics and future research directions.

Findings

01

Identified key approximation families and their characteristics.

02

Assessed evaluation complexity and efficiency of approximate DNN accelerators.

03

Discussed error metrics, accuracy recovery, and broader benefits like security.

Abstract

Deep Neural Networks (DNNs) are very popular because of their high performance in various cognitive tasks in Machine Learning (ML). Recent advancements in DNNs have brought beyond human accuracy in many tasks, but at the cost of high computational complexity. To enable efficient execution of DNN inference, more and more research works, therefore, exploit the inherent error resilience of DNNs and employ Approximate Computing (AC) principles to address the elevated energy demands of DNN accelerators. This article provides a comprehensive survey and analysis of hardware approximation techniques for DNN accelerators. First, we analyze the state of the art and by identifying approximation families, we cluster the respective works with respect to the approximation type. Next, we analyze the complexity of the performed evaluations (with respect to the dataset and DNN size) to assess the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.