Resource-Efficient Neural Networks for Embedded Systems

Wolfgang Roth; G\"unther Schindler; Bernhard Klein; Robert Peharz,; Sebastian Tschiatschek; Holger Fr\"oning; Franz Pernkopf; Zoubin Ghahramani

arXiv:2001.03048·stat.ML·April 9, 2024·20 cites

Resource-Efficient Neural Networks for Embedded Systems

Wolfgang Roth, G\"unther Schindler, Bernhard Klein, Robert Peharz,, Sebastian Tschiatschek, Holger Fr\"oning, Franz Pernkopf, Zoubin Ghahramani

PDF

Open Access

TL;DR

This paper reviews techniques for making deep neural networks more resource-efficient for embedded systems, focusing on quantization, pruning, and structural optimization to balance performance with limited computational and energy resources.

Contribution

It provides a comprehensive overview of current methods for resource-efficient DNN inference, including experimental validation on embedded hardware platforms.

Findings

01

Quantization and pruning significantly reduce model size and energy consumption.

02

Trade-offs between resource efficiency and prediction accuracy are challenging.

03

Different hardware architectures have varying compatibility with efficiency techniques.

Abstract

While machine learning is traditionally a resource intensive task, embedded systems, autonomous navigation, and the vision of the Internet of Things fuel the interest in resource-efficient approaches. These approaches aim for a carefully chosen trade-off between performance and resource consumption in terms of computation and energy. The development of such approaches is among the major challenges in current machine learning research and key to ensure a smooth transition of machine learning technology from a scientific environment with virtually unlimited computing resources into everyday's applications. In this article, we provide an overview of the current state of the art of machine learning techniques facilitating these real-world requirements. In particular, we focus on resource-efficient inference based on deep neural networks (DNNs), the predominant machine learning models of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification