FedDRL: A Trustworthy Federated Learning Model Fusion Method Based on Staged Reinforcement Learning
Leiming Chen, Weishan Zhang, Cihao Dong, Sibo Qiao, Ziling Huang,, Yuming Nie, Zhaoxiang Hou, Chee Wei Tan

TL;DR
FedDRL introduces a staged reinforcement learning approach to improve federated learning by filtering malicious models and adaptively weighting trusted models, enhancing reliability without sacrificing accuracy.
Contribution
The paper presents a novel two-stage reinforcement learning method for model fusion in federated learning, addressing data heterogeneity and malicious model threats.
Findings
FedDRL outperforms baseline algorithms in reliability across five scenarios.
The method effectively filters malicious models and adaptively weights trusted models.
Maintains high accuracy comparable to traditional federated learning methods.
Abstract
Traditional federated learning uses the number of samples to calculate the weights of each client model and uses this fixed weight value to fusion the global model. However, in practical scenarios, each client's device and data heterogeneity leads to differences in the quality of each client's model. Thus the contribution to the global model is not wholly determined by the sample size. In addition, if clients intentionally upload low-quality or malicious models, using these models for aggregation will lead to a severe decrease in global model accuracy. Traditional federated learning algorithms do not address these issues. To solve this probelm, we propose FedDRL, a model fusion approach using reinforcement learning based on a two staged approach. In the first stage, Our method could filter out malicious models and selects trusted client models to participate in the model fusion. In the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Access Control and Trust
