Selective Task offloading for Maximum Inference Accuracy and Energy   efficient Real-Time IoT Sensing Systems

Abdelkarim Ben Sada; Amar Khelloufi; Abdenacer Naouri; Huansheng Ning; and Sahraoui Dhelim

arXiv:2402.16904·cs.LG·March 3, 2024·1 cites

Selective Task offloading for Maximum Inference Accuracy and Energy efficient Real-Time IoT Sensing Systems

Abdelkarim Ben Sada, Amar Khelloufi, Abdenacer Naouri, Huansheng Ning, and Sahraoui Dhelim

PDF

Open Access

TL;DR

This paper presents a hybrid genetic algorithm for optimally allocating inference models in IoT edge systems, maximizing accuracy while respecting time and energy limits, and significantly improves speed and accuracy over existing methods.

Contribution

It introduces LGSTO, a lightweight hybrid genetic algorithm with novel termination and neighborhood techniques for efficient model allocation in resource-constrained IoT systems.

Findings

01

LGSTO is 3 times faster than comparable schemes.

02

LGSTO achieves higher average inference accuracy.

03

The problem is formulated as a strongly NP-hard multidimensional knapsack problem.

Abstract

The recent advancements in small-size inference models facilitated AI deployment on the edge. However, the limited resource nature of edge devices poses new challenges especially for real-time applications. Deploying multiple inference models (or a single tunable model) varying in size and therefore accuracy and power consumption, in addition to an edge server inference model, can offer a dynamic system in which the allocation of inference models to inference jobs is performed according to the current resource conditions. Therefore, in this work, we tackle the problem of selectively allocating inference models to jobs or offloading them to the edge server to maximize inference accuracy under time and energy constraints. This problem is shown to be an instance of the unbounded multidimensional knapsack problem which is considered a strongly NP-hard problem. We propose a lightweight…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIoT and Edge/Fog Computing