FLASH-RL: Federated Learning Addressing System and Static Heterogeneity using Reinforcement Learning
Sofiane Bouaziz, Hadjer Benmeziane, Youcef Imine, Leila Hamdad, Smail, Niar, Hamza Ouarnoughi

TL;DR
FLASH-RL is a novel federated learning framework that uses reinforcement learning to optimize client selection, effectively balancing system heterogeneity, reducing latency, and accelerating training on multiple datasets.
Contribution
It introduces a reinforcement learning-based client selection method with a reputation utility function, addressing system and static heterogeneity in federated learning.
Findings
Reduces latency by up to 24.83% compared to FedAVG.
Cuts training rounds by up to 60.44% over FedAVG.
Improves model performance and training speed in fall detection.
Abstract
Federated Learning (FL) has emerged as a promising Machine Learning paradigm, enabling multiple users to collaboratively train a shared model while preserving their local data. To minimize computing and communication costs associated with parameter transfer, it is common practice in FL to select a subset of clients in each training round. This selection must consider both system and static heterogeneity. Therefore, we propose FLASH-RL, a framework that utilizes Double Deep QLearning (DDQL) to address both system and static heterogeneity in FL. FLASH-RL introduces a new reputation-based utility function to evaluate client contributions based on their current and past performances. Additionally, an adapted DDQL algorithm is proposed to expedite the learning process. Experimental results on MNIST and CIFAR-10 datasets have shown FLASH-RL's effectiveness in achieving a balanced trade-off…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data
MethodsDouble Deep Q-Learning
