Optimising Resource Management for Embedded Machine Learning
Lei Xun, Long Tran-Thanh, Bashir M Al-Hashimi, Geoff V. Merrett

TL;DR
This paper explores online resource management strategies for embedded machine learning on heterogeneous multi-core systems, aiming to optimize performance metrics like speed, energy, and accuracy amidst resource variability.
Contribution
It introduces approaches for dynamic resource management that enable scalable DNN execution, balancing multiple performance metrics on diverse embedded platforms.
Findings
Dynamic scalability of DNNs improves performance trade-offs.
Resource management strategies adapt to platform heterogeneity.
Consistent performance achieved across varying hardware conditions.
Abstract
Machine learning inference is increasingly being executed locally on mobile and embedded platforms, due to the clear advantages in latency, privacy and connectivity. In this paper, we present approaches for online resource management in heterogeneous multi-core systems and show how they can be applied to optimise the performance of machine learning workloads. Performance can be defined using platform-dependent (e.g. speed, energy) and platform-independent (accuracy, confidence) metrics. In particular, we show how a Deep Neural Network (DNN) can be dynamically scalable to trade-off these various performance metrics. Achieving consistent performance when executing on different platforms is necessary yet challenging, due to the different resources provided and their capability, and their time-varying availability when executing alongside other workloads. Managing the interface between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Parallel Computing and Optimization Techniques · Advanced Memory and Neural Computing
