Towards Communication-Efficient Model Updating for On-Device   Session-Based Recommendation

Xin Xia; Junliang Yu; Guandong Xu; Hongzhi Yin

arXiv:2308.12777·cs.IR·August 25, 2023

Towards Communication-Efficient Model Updating for On-Device Session-Based Recommendation

Xin Xia, Junliang Yu, Guandong Xu, Hongzhi Yin

PDF

Open Access 1 Repo

TL;DR

This paper introduces a compositional coding method to compress model updates for on-device session-based recommenders, significantly reducing network bandwidth while maintaining accuracy.

Contribution

It proposes a novel compositional coding approach to efficiently compress server-updated models for on-device recommendation systems.

Findings

01

Achieves 60x smaller model updates with comparable accuracy

02

Demonstrates effectiveness across multiple recommendation architectures

03

Reduces network bandwidth for model updates significantly

Abstract

On-device recommender systems recently have garnered increasing attention due to their advantages of providing prompt response and securing privacy. To stay current with evolving user interests, cloud-based recommender systems are periodically updated with new interaction data. However, on-device models struggle to retrain themselves because of limited onboard computing resources. As a solution, we consider the scenario where the model retraining occurs on the server side and then the updated parameters are transferred to edge devices via network communication. While this eliminates the need for local retraining, it incurs a regular transfer of parameters that significantly taxes network bandwidth. To mitigate this issue, we develop an efficient approach based on compositional codes to compress the model update. This approach ensures the on-device model is updated flexibly with minimal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xiaxin1998/odupdate
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Caching and Content Delivery · Advanced Wireless Network Optimization