TL;DR
This paper explores the challenges and solutions for deploying deep learning models on mobile devices, focusing on balancing accuracy, complexity, and efficiency to optimize performance in resource-constrained environments.
Contribution
It provides a practical study of deployment challenges, documents experiences, and proposes solutions for optimizing deep learning model performance on mobile platforms.
Findings
Identified key challenges like framework availability and data dependency.
Implemented a solution to improve model sustainability and reduce challenges.
Discussed the impact of complexity on accuracy and performance trade-offs.
Abstract
When building Deep Learning (DL) models, data scientists and software engineers manage the trade-off between their accuracy, or any other suitable success criteria, and their complexity. In an environment with high computational power, a common practice is making the models go deeper by designing more sophisticated architectures. However, in the context of mobile devices, which possess less computational power, keeping complexity under control is a must. In this paper, we study the performance of a system that integrates a DL model as a trade-off between the accuracy and the complexity. At the same time, we relate the complexity to the efficiency of the system. With this, we present a practical study that aims to explore the challenges met when optimizing the performance of DL models becomes a requirement. Concretely, we aim to identify: (i) the most concerning challenges when deploying…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
