Modeling the Resource Requirements of Convolutional Neural Networks on Mobile Devices
Zongqing Lu, Swati Rallapalli, Kevin Chan, Thomas La Porta

TL;DR
This paper investigates the resource needs of CNNs on mobile devices by measuring, modeling, and developing a tool to estimate compute time and resource usage, facilitating efficient deployment of deep learning models on mobile platforms.
Contribution
It introduces Augur, a modeling tool that predicts CNN resource requirements on mobile devices, addressing measurement overhead and platform variability.
Findings
Measured CNN performance on mobile CPUs and GPUs
Developed a model to estimate compute time and resource usage
Created Augur, a tool for resource prediction and optimization
Abstract
Convolutional Neural Networks (CNNs) have revolutionized the research in computer vision, due to their ability to capture complex patterns, resulting in high inference accuracies. However, the increasingly complex nature of these neural networks means that they are particularly suited for server computers with powerful GPUs. We envision that deep learning applications will be eventually and widely deployed on mobile devices, e.g., smartphones, self-driving cars, and drones. Therefore, in this paper, we aim to understand the resource requirements (time, memory) of CNNs on mobile devices. First, by deploying several popular CNNs on mobile CPUs and GPUs, we measure and analyze the performance and resource usage for every layer of the CNNs. Our findings point out the potential ways of optimizing the performance on mobile devices. Second, we model the resource requirements of the different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
