Deep Partial Updating: Towards Communication Efficient Updating for   On-device Inference

Zhongnan Qu; Cong Liu; Lothar Thiele

arXiv:2007.03071·cs.LG·July 28, 2022

Deep Partial Updating: Towards Communication Efficient Updating for On-device Inference

Zhongnan Qu, Cong Liu, Lothar Thiele

PDF

TL;DR

This paper introduces a communication-efficient method for updating deep neural networks on edge devices by selectively updating a small subset of weights, maintaining high accuracy with minimal data transfer.

Contribution

It proposes a weight-wise deep partial updating approach that intelligently selects weights to update, reducing communication costs while preserving model performance.

Findings

01

Achieves high inference accuracy with fewer weight updates.

02

Effectively bounds loss difference between partial and full updates.

03

Demonstrates efficiency through extensive experiments.

Abstract

Emerging edge intelligence applications require the server to retrain and update deep neural networks deployed on remote edge nodes to leverage newly collected data samples. Unfortunately, it may be impossible in practice to continuously send fully updated weights to these edge nodes due to the highly constrained communication resource. In this paper, we propose the weight-wise deep partial updating paradigm, which smartly selects a small subset of weights to update in each server-to-edge communication round, while achieving a similar performance compared to full updating. Our method is established through analytically upper-bounding the loss difference between partial updating and full updating, and only updates the weights which make the largest contributions to the upper bound. Extensive experimental results demonstrate the efficacy of our partial updating methodology which achieves…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.