A Multi-Armed Bandit-Based Participant Selection Method for Federated Recommendation Systems
Jintao Liu, Mohammad Goudarzi, and Adel Nadjaran Toosi

TL;DR
This paper introduces a Multi-Armed Bandit-based participant selection method for federated recommendation systems, enhancing training efficiency and model quality in heterogeneous edge-cloud environments.
Contribution
It formulates participant selection as a utility optimization problem and proposes a dynamic MAB-based framework that balances exploration and exploitation.
Findings
Outperforms baseline methods across eight data-skew scenarios.
Improves training efficiency by 32-50%.
Enhances model quality metrics such as Recall@50 by up to 5%.
Abstract
Federated Recommendation Systems (FRS) enable privacy-preserving model training by keeping user data on edge devices. However, the practical deployment of FRS in Edge-Cloud environments faces significant challenges due to system and statistical heterogeneity. Existing FRS participant selection strategies struggle to dynamically balance the trade-off between model convergence speed and recommendation quality in such volatile environments. To address this, we formulate the FRS participant selection problem as a normalized utility cost addressing the model quality and system efficiency. Next, we propose a dynamic participant selection framework incorporating a Multi-Armed Bandit (MAB)-based solver for multimodal FRS. We design a client-utility function that jointly evaluates historical Client Performance Reputation, data quality, and real-time system latency. By leveraging an Upper…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
