Towards Communication-Efficient Model Updating for On-Device Session-Based Recommendation
Xin Xia, Junliang Yu, Guandong Xu, Hongzhi Yin

TL;DR
This paper introduces a compositional coding method to compress model updates for on-device session-based recommenders, significantly reducing network bandwidth while maintaining accuracy.
Contribution
It proposes a novel compositional coding approach to efficiently compress server-updated models for on-device recommendation systems.
Findings
Achieves 60x smaller model updates with comparable accuracy
Demonstrates effectiveness across multiple recommendation architectures
Reduces network bandwidth for model updates significantly
Abstract
On-device recommender systems recently have garnered increasing attention due to their advantages of providing prompt response and securing privacy. To stay current with evolving user interests, cloud-based recommender systems are periodically updated with new interaction data. However, on-device models struggle to retrain themselves because of limited onboard computing resources. As a solution, we consider the scenario where the model retraining occurs on the server side and then the updated parameters are transferred to edge devices via network communication. While this eliminates the need for local retraining, it incurs a regular transfer of parameters that significantly taxes network bandwidth. To mitigate this issue, we develop an efficient approach based on compositional codes to compress the model update. This approach ensures the on-device model is updated flexibly with minimal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Caching and Content Delivery · Advanced Wireless Network Optimization
