OODIn: An Optimised On-Device Inference Framework for Heterogeneous Mobile Devices
Stylianos I. Venieris, Ioannis Panopoulos, Iakovos S. Venieris

TL;DR
OODIn is a novel framework that optimizes deep learning model deployment on diverse mobile devices by balancing performance and resource constraints, enabling efficient and adaptable inference.
Contribution
The paper introduces a DL-specific software architecture and an analytical framework for multi-objective optimization tailored to heterogeneous mobile devices.
Findings
Up to 4.3x performance improvement over existing methods.
Effective adaptation to dynamic resource changes.
Consistent outperformance across various device types.
Abstract
Radical progress in the field of deep learning (DL) has led to unprecedented accuracy in diverse inference tasks. As such, deploying DL models across mobile platforms is vital to enable the development and broad availability of the next-generation intelligent apps. Nevertheless, the wide and optimised deployment of DL models is currently hindered by the vast system heterogeneity of mobile devices, the varying computational cost of different DL models and the variability of performance needs across DL applications. This paper proposes OODIn, a framework for the optimised deployment of DL apps across heterogeneous mobile devices. OODIn comprises a novel DL-specific software architecture together with an analytical framework for modelling DL applications that: (1) counteract the variability in device resources and DL models by means of a highly parametrised multi-layer design; and (2)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
