Deep Partial Updating: Towards Communication Efficient Updating for On-device Inference
Zhongnan Qu, Cong Liu, Lothar Thiele

TL;DR
This paper introduces a communication-efficient method for updating deep neural networks on edge devices by selectively updating a small subset of weights, maintaining high accuracy with minimal data transfer.
Contribution
It proposes a weight-wise deep partial updating approach that intelligently selects weights to update, reducing communication costs while preserving model performance.
Findings
Achieves high inference accuracy with fewer weight updates.
Effectively bounds loss difference between partial and full updates.
Demonstrates efficiency through extensive experiments.
Abstract
Emerging edge intelligence applications require the server to retrain and update deep neural networks deployed on remote edge nodes to leverage newly collected data samples. Unfortunately, it may be impossible in practice to continuously send fully updated weights to these edge nodes due to the highly constrained communication resource. In this paper, we propose the weight-wise deep partial updating paradigm, which smartly selects a small subset of weights to update in each server-to-edge communication round, while achieving a similar performance compared to full updating. Our method is established through analytically upper-bounding the loss difference between partial updating and full updating, and only updates the weights which make the largest contributions to the upper bound. Extensive experimental results demonstrate the efficacy of our partial updating methodology which achieves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
