Resource-Efficient Neural Networks for Embedded Systems
Wolfgang Roth, G\"unther Schindler, Bernhard Klein, Robert Peharz,, Sebastian Tschiatschek, Holger Fr\"oning, Franz Pernkopf, Zoubin Ghahramani

TL;DR
This paper reviews techniques for making deep neural networks more resource-efficient for embedded systems, focusing on quantization, pruning, and structural optimization to balance performance with limited computational and energy resources.
Contribution
It provides a comprehensive overview of current methods for resource-efficient DNN inference, including experimental validation on embedded hardware platforms.
Findings
Quantization and pruning significantly reduce model size and energy consumption.
Trade-offs between resource efficiency and prediction accuracy are challenging.
Different hardware architectures have varying compatibility with efficiency techniques.
Abstract
While machine learning is traditionally a resource intensive task, embedded systems, autonomous navigation, and the vision of the Internet of Things fuel the interest in resource-efficient approaches. These approaches aim for a carefully chosen trade-off between performance and resource consumption in terms of computation and energy. The development of such approaches is among the major challenges in current machine learning research and key to ensure a smooth transition of machine learning technology from a scientific environment with virtually unlimited computing resources into everyday's applications. In this article, we provide an overview of the current state of the art of machine learning techniques facilitating these real-world requirements. In particular, we focus on resource-efficient inference based on deep neural networks (DNNs), the predominant machine learning models of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification
