MultiTASC++: A Continuously Adaptive Scheduler for Edge-Based Multi-Device Cascade Inference
Sokratis Nikolaidis, Stylianos I. Venieris, Iakovos S. Venieris

TL;DR
MultiTASC++ is a dynamic scheduler designed for multi-device cascade inference in IoT environments, optimizing throughput, accuracy, and latency across diverse devices and workloads.
Contribution
It introduces a novel adaptive scheduler that manages multi-tenancy in multi-device cascade systems, ensuring high accuracy and efficiency in dynamic IoT settings.
Findings
Maintains targeted satisfaction rate across 100 devices.
Achieves high accuracy with low latency.
Demonstrates scalability and adaptability in diverse environments.
Abstract
Cascade systems, consisting of a lightweight model processing all samples and a heavier, high-accuracy model refining challenging samples, have become a widely-adopted distributed inference approach to achieving high accuracy and maintaining a low computational burden for mobile and IoT devices. As intelligent indoor environments, like smart homes, continue to expand, a new scenario emerges, the multi-device cascade. In this setting, multiple diverse devices simultaneously utilize a shared heavy model hosted on a server, often situated within or close to the consumer environment. This work introduces MultiTASC++, a continuously adaptive multi-tenancy-aware scheduler that dynamically controls the forwarding decision functions of devices to optimize system throughput while maintaining high accuracy and low latency. Through extensive experimentation in diverse device environments and with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
