Deep Neural Network Approximation for Custom Hardware: Where We've Been, Where We're Going
Erwei Wang, James J. Davis, Ruizhe Zhao, Ho-Cheung Ng, Xinyu Niu,, Wayne Luk, Peter Y. K. Cheung, George A. Constantinides

TL;DR
This paper surveys approximation methods for neural networks tailored for custom hardware, highlighting their effectiveness and future research directions to improve efficiency and deployment feasibility.
Contribution
It provides the first comprehensive comparison of hardware accelerators with approximation techniques for convolutional and recurrent neural networks.
Findings
Custom hardware accelerators outperform general-purpose processors in throughput and energy efficiency.
Approximation methods enable deployment of smaller, sparse, and hardware-efficient neural networks.
The survey offers detailed evaluations and future research proposals in the field.
Abstract
Deep neural networks have proven to be particularly effective in visual and audio recognition tasks. Existing models tend to be computationally expensive and memory intensive, however, and so methods for hardware-oriented approximation have become a hot topic. Research has shown that custom hardware-based neural network accelerators can surpass their general-purpose processor equivalents in terms of both throughput and energy efficiency. Application-tailored accelerators, when co-designed with approximation-based network training methods, transform large, dense and computationally expensive networks into small, sparse and hardware-efficient alternatives, increasing the feasibility of network deployment. In this article, we provide a comprehensive evaluation of approximation methods for high-performance network inference along with in-depth discussion of their effectiveness for custom…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM
