FedCDC: A Collaborative Framework for Data Consumers in Federated   Learning Market

Zhuan Shi; Patrick Ohl; Boi Faltings

arXiv:2502.19109·cs.DC·February 27, 2025

FedCDC: A Collaborative Framework for Data Consumers in Federated Learning Market

Zhuan Shi, Patrick Ohl, Boi Faltings

PDF

Open Access

TL;DR

FedCDC introduces a collaborative framework in federated learning markets that detects shared subtasks among Data Consumers, enabling joint training and ensemble distillation to improve model accuracy despite data access restrictions.

Contribution

This work presents a novel framework, FedCDC, for collaborative recruitment and training among Data Consumers with similar tasks in federated learning markets, enhancing performance under data access constraints.

Findings

01

Joint training mitigates performance loss due to data access restrictions.

02

Ensemble distillation improves accuracy of Data Consumers' models.

03

Experimental results show significant accuracy gains with FedCDC.

Abstract

Federated learning (FL) allows machine learning models to be trained on distributed datasets without directly accessing local data. In FL markets, numerous Data Consumers compete to recruit Data Owners for their respective training tasks, but budget constraints and competition can prevent them from securing sufficient data. While existing solutions focus on optimizing one-to-one matching between Data Owners and Data Consumers, we propose \methodname{}, a novel framework that facilitates collaborative recruitment and training for Data Consumers with similar tasks. Specifically, \methodname{} detects shared subtasks among multiple Data Consumers and coordinates the joint training of submodels specialized for these subtasks. Then, through ensemble distillation, these submodels are integrated into each Data Consumer global model. Experimental evaluations on three benchmark datasets…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Ethics and Social Impacts of AI · Data Quality and Management