Hardware and Software Optimizations for Accelerating Deep Neural Networks: Survey of Current Trends, Challenges, and the Road Ahead
Maurizio Capra, Beatrice Bussolino, Alberto Marchisio, Guido Masera,, Maurizio Martina, Muhammad Shafique

TL;DR
This survey reviews current hardware and software strategies for accelerating deep neural networks, emphasizing energy efficiency, security issues, and benchmarking across platforms like CPU, GPU, FPGA, and ASIC.
Contribution
It provides a comprehensive comparison of state-of-the-art hardware solutions and discusses security and benchmarking challenges in deploying deep learning models.
Findings
FPGAs and ASICs offer greater energy efficiency and design flexibility.
Security vulnerabilities are significant during DNN and SNN execution.
Benchmarking methods are essential for evaluating hardware and network performance.
Abstract
Currently, Machine Learning (ML) is becoming ubiquitous in everyday life. Deep Learning (DL) is already present in many applications ranging from computer vision for medicine to autonomous driving of modern cars as well as other sectors in security, healthcare, and finance. However, to achieve impressive performance, these algorithms employ very deep networks, requiring a significant computational power, both during the training and inference time. A single inference of a DL model may require billions of multiply-and-accumulated operations, making the DL extremely compute- and energy-hungry. In a scenario where several sophisticated algorithms need to be executed with limited energy and low latency, the need for cost-effective hardware platforms capable of implementing energy-efficient DL execution arises. This paper first introduces the key properties of two brain-inspired models like…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
