TL;DR
This paper introduces FELARE, a lightweight, fair scheduling heuristic for heterogeneous edge systems that optimizes energy and latency for machine learning tasks, improving task completion and energy efficiency.
Contribution
The paper proposes a novel fair, multi-objective heuristic for resource allocation in energy-limited, heterogeneous edge systems, outperforming existing methods.
Findings
8.9% increase in on-time task completion rate
12.6% energy savings
Effective at low to moderate request rates
Abstract
Edge computing enables smart IoT-based systems via concurrent and continuous execution of latency-sensitive machine learning (ML) applications. These edge-based machine learning systems are often battery-powered (i.e., energy-limited). They use heterogeneous resources with diverse computing performance (e.g., CPU, GPU, and/or FPGAs) to fulfill the latency constraints of ML applications. The challenge is to allocate user requests for different ML applications on the Heterogeneous Edge Computing Systems (HEC) with respect to both the energy and latency constraints of these systems. To this end, we study and analyze resource allocation solutions that can increase the on-time task completion rate while considering the energy constraint. Importantly, we investigate edge-friendly (lightweight) multi-objective mapping heuristics that do not become biased toward a particular application type to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
