FedDRL: A Trustworthy Federated Learning Model Fusion Method Based on   Staged Reinforcement Learning

Leiming Chen; Weishan Zhang; Cihao Dong; Sibo Qiao; Ziling Huang,; Yuming Nie; Zhaoxiang Hou; Chee Wei Tan

arXiv:2307.13716·cs.LG·March 20, 2024

FedDRL: A Trustworthy Federated Learning Model Fusion Method Based on Staged Reinforcement Learning

Leiming Chen, Weishan Zhang, Cihao Dong, Sibo Qiao, Ziling Huang,, Yuming Nie, Zhaoxiang Hou, Chee Wei Tan

PDF

Open Access

TL;DR

FedDRL introduces a staged reinforcement learning approach to improve federated learning by filtering malicious models and adaptively weighting trusted models, enhancing reliability without sacrificing accuracy.

Contribution

The paper presents a novel two-stage reinforcement learning method for model fusion in federated learning, addressing data heterogeneity and malicious model threats.

Findings

01

FedDRL outperforms baseline algorithms in reliability across five scenarios.

02

The method effectively filters malicious models and adaptively weights trusted models.

03

Maintains high accuracy comparable to traditional federated learning methods.

Abstract

Traditional federated learning uses the number of samples to calculate the weights of each client model and uses this fixed weight value to fusion the global model. However, in practical scenarios, each client's device and data heterogeneity leads to differences in the quality of each client's model. Thus the contribution to the global model is not wholly determined by the sample size. In addition, if clients intentionally upload low-quality or malicious models, using these models for aggregation will lead to a severe decrease in global model accuracy. Traditional federated learning algorithms do not address these issues. To solve this probelm, we propose FedDRL, a model fusion approach using reinforcement learning based on a two staged approach. In the first stage, Our method could filter out malicious models and selects trusted client models to participate in the model fusion. In the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Access Control and Trust