CARIn: Constraint-Aware and Responsive Inference on Heterogeneous Devices for Single- and Multi-DNN Workloads
Ioannis Panopoulos, Stylianos I. Venieris, Iakovos S. Venieris

TL;DR
CARIn is a framework that optimizes the deployment of single- and multi-DNN workloads on heterogeneous devices, enabling real-time, privacy-preserving, and resource-efficient inference with dynamic adaptation.
Contribution
It introduces a multi-objective optimization framework with a runtime-aware search algorithm for efficient, adaptive deployment of DNNs on mobile devices under various constraints.
Findings
Achieves up to 1.92x better fairness compared to single-model designs.
Outperforms state-of-the-art OODIn framework by up to 10.69x.
Gains up to 4.06x in multi-DNN applications over hardware-unaware designs.
Abstract
The relentless expansion of deep learning applications in recent years has prompted a pivotal shift toward on-device execution, driven by the urgent need for real-time processing, heightened privacy concerns, and reduced latency across diverse domains. This article addresses the challenges inherent in optimising the execution of deep neural networks (DNNs) on mobile devices, with a focus on device heterogeneity, multi-DNN execution, and dynamic runtime adaptation. We introduce CARIn, a novel framework designed for the optimised deployment of both single- and multi-DNN applications under user-defined service-level objectives. Leveraging an expressive multi-objective optimisation framework and a runtime-aware sorting and search algorithm (RASS) as the MOO solver, CARIn facilitates efficient adaptation to dynamic conditions while addressing resource contention issues associated with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSparse Evolutionary Training · Focus
