FedBaF: Federated Learning Aggregation Biased by a Foundation Model

Jong-Ik Park; Srinivasa Pranav; Jos\'e M. F. Moura; Carlee Joe-Wong

arXiv:2410.18352·cs.LG·June 24, 2025

FedBaF: Federated Learning Aggregation Biased by a Foundation Model

Jong-Ik Park, Srinivasa Pranav, Jos\'e M. F. Moura, Carlee Joe-Wong

PDF

Open Access

TL;DR

FedBaF is a novel federated learning method that securely integrates foundation models during aggregation, improving accuracy and reducing perplexity without exposing model weights, especially in non-IID and adversarial settings.

Contribution

Introduces FedBaF, a new federated learning approach that preserves foundation model confidentiality while enhancing training performance.

Findings

01

FedBaF outperforms traditional methods by up to 11.4% in IID settings.

02

FedBaF improves accuracy by up to 15.8% in non-IID scenarios.

03

Reduces language model perplexity by up to 39.2%.

Abstract

Foundation models are now a major focus of leading technology organizations due to their ability to generalize across diverse tasks. Existing approaches for adapting foundation models to new applications often rely on Federated Learning (FL) and disclose the foundation model weights to clients when using it to initialize the global model. While these methods ensure client data privacy, they compromise model and information security. In this paper, we introduce Federated Learning Aggregation Biased by a Foundation Model (FedBaF), a novel method for dynamically integrating pre-trained foundation model weights during the FL aggregation phase. Unlike conventional methods, FedBaF preserves the confidentiality of the foundation model while still leveraging its power to train more accurate models, especially in non-IID and adversarial scenarios. Our comprehensive experiments use Pre-ResNet and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security

MethodsAdam · Linear Layer · Dropout · Absolute Position Encodings · Label Smoothing · Byte Pair Encoding · Layer Normalization · Residual Connection · Position-Wise Feed-Forward Layer · Attention Is All You Need