Responsive DNN Adaptation for Video Analytics against Environment Shift via Hierarchical Mobile-Cloud Collaborations
Maozhe Zhao, Shengzhong Liu, Fan Wu, Guihai Chen

TL;DR
This paper introduces MOCHA, a hierarchical mobile-cloud framework that enhances responsiveness and efficiency in adapting deep neural networks for mobile video analytics amid environment shifts.
Contribution
MOCHA innovatively combines on-device model reuse, structured taxonomy for model retrieval, and proactive cache management to improve adaptation responsiveness.
Findings
Up to 6.8% accuracy improvement during adaptation
Response delay reduced by up to 35.5 times
Retraining time decreased by up to 3 times
Abstract
Mobile video analysis systems often encounter various deploying environments, where environment shifts present greater demands for responsiveness in adaptations of deployed "expert DNN models". Existing model adaptation frameworks primarily operate in a cloud-centric way, exhibiting degraded performance during adaptation and delayed reactions to environment shifts. Instead, this paper proposes MOCHA, a novel framework optimizing the responsiveness of continuous model adaptation through hierarchical collaborations between mobile and cloud resources. Specifically, MOCHA (1) reduces adaptation response delays by performing on-device model reuse and fast fine-tuning before requesting cloud model retrieval and end-to-end retraining; (2) accelerates history expert model retrieval by organizing them into a structured taxonomy utilizing domain semantics analyzed by a cloud foundation model as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBrain Tumor Detection and Classification · Advanced Computing and Algorithms · COVID-19 diagnosis using AI
