FedCFA: Alleviating Simpson's Paradox in Model Aggregation with   Counterfactual Federated Learning

Zhonghua Jiang; Jimin Xu; Shengyu Zhang; Tao Shen; Jiwei Li; Kun; Kuang; Haibin Cai; Fei Wu

arXiv:2412.18904·cs.LG·December 30, 2024

FedCFA: Alleviating Simpson's Paradox in Model Aggregation with Counterfactual Federated Learning

Zhonghua Jiang, Jimin Xu, Shengyu Zhang, Tao Shen, Jiwei Li, Kun, Kuang, Haibin Cai, Fei Wu

PDF

Open Access 1 Repo 1 Video

TL;DR

FedCFA introduces a counterfactual learning framework in federated learning to address Simpson's Paradox caused by data heterogeneity, improving global model accuracy and efficiency.

Contribution

The paper proposes FedCFA, a novel federated learning approach using counterfactual samples and factor decorrelation to mitigate Simpson's Paradox effects in non-IID data.

Findings

01

Outperforms existing FL methods in accuracy and efficiency

02

Effectively mitigates Simpson's Paradox in heterogeneous data scenarios

03

Achieves superior results on six benchmark datasets

Abstract

Federated learning (FL) is a promising technology for data privacy and distributed optimization, but it suffers from data imbalance and heterogeneity among clients. Existing FL methods try to solve the problems by aligning client with server model or by correcting client model with control variables. These methods excel on IID and general Non-IID data but perform mediocrely in Simpson's Paradox scenarios. Simpson's Paradox refers to the phenomenon that the trend observed on the global dataset disappears or reverses on a subset, which may lead to the fact that global model obtained through aggregation in FL does not accurately reflect the distribution of global data. Thus, we propose FedCFA, a novel FL framework employing counterfactual learning to generate counterfactual samples by replacing local data critical factors with global average data, aligning local data distributions with the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hua-zi/FedCFA
pytorch

Videos

FedCFA: Alleviating Simpson's Paradox in Model Aggregation with Counterfactual Federated Learning· underline

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Internet Traffic Analysis and Secure E-voting