HE2C: A Holistic Approach for Allocating Latency-Sensitive AI Tasks across Edge-Cloud
Minseo Kim, Wei Shu, Mohsen Amini Salehi

TL;DR
HE2C is a comprehensive framework that optimizes latency, energy, accuracy, and throughput for deep learning tasks across edge and cloud resources, improving performance and battery life in resource-constrained environments.
Contribution
HE2C introduces a holistic approach with modules for feasibility checking, resource allocation, and approximate computing to optimize multiple metrics simultaneously.
Findings
Significantly increases task throughput within deadlines.
Preserves battery life and accuracy on edge devices.
Reduces latency impact through approximate computing.
Abstract
The high computational, memory, and energy demands of Deep Learning (DL) applications often exceed the capabilities of battery-powered edge devices, creating difficulties in meeting task deadlines and accuracy requirements. Unlike previous solutions that optimize a single metric (e.g., accuracy or energy efficiency), HE2C framework is designed to holistically address the latency, memory, accuracy, throughput, and energy demands of DL applications across edge-cloud continuum, thereby, delivering a more comprehensive and effective user experience. HE2C comprises three key modules: (a) a "feasibility-check module that evaluates the likelihood of meeting deadlines across both edge and cloud resources; (b) a "resource allocation strategy" that maximizes energy efficiency without sacrificing the inference accuracy; and (c) a "rescue module" that enhances throughput by leveraging approximate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIoT and Edge/Fog Computing · Robotics and Automated Systems
