Optimising Resource Management for Embedded Machine Learning

Lei Xun; Long Tran-Thanh; Bashir M Al-Hashimi; Geoff V. Merrett

arXiv:2105.03608·cs.CV·May 11, 2021

Optimising Resource Management for Embedded Machine Learning

Lei Xun, Long Tran-Thanh, Bashir M Al-Hashimi, Geoff V. Merrett

PDF

Open Access 1 Repo

TL;DR

This paper explores online resource management strategies for embedded machine learning on heterogeneous multi-core systems, aiming to optimize performance metrics like speed, energy, and accuracy amidst resource variability.

Contribution

It introduces approaches for dynamic resource management that enable scalable DNN execution, balancing multiple performance metrics on diverse embedded platforms.

Findings

01

Dynamic scalability of DNNs improves performance trade-offs.

02

Resource management strategies adapt to platform heterogeneity.

03

Consistent performance achieved across varying hardware conditions.

Abstract

Machine learning inference is increasingly being executed locally on mobile and embedded platforms, due to the clear advantages in latency, privacy and connectivity. In this paper, we present approaches for online resource management in heterogeneous multi-core systems and show how they can be applied to optimise the performance of machine learning workloads. Performance can be defined using platform-dependent (e.g. speed, energy) and platform-independent (accuracy, confidence) metrics. In particular, we show how a Deep Neural Network (DNN) can be dynamically scalable to trade-off these various performance metrics. Achieving consistent performance when executing on different platforms is necessary yet challenging, due to the different resources provided and their capability, and their time-varying availability when executing alongside other workloads. Managing the interface between…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

UoS-EEC/DynamicOFA
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Parallel Computing and Optimization Techniques · Advanced Memory and Neural Computing