An Interactive Framework for Implementing Privacy-Preserving Federated   Learning: Experiments on Large Language Models

Kasra Ahmadi; Rouzbeh Behnia; Reza Ebrahimi; Mehran Mozaffari Kermani,; Jeremiah Birrell; Jason Pacheco; Attila A Yavuz

arXiv:2502.08008·cs.LG·February 17, 2025

An Interactive Framework for Implementing Privacy-Preserving Federated Learning: Experiments on Large Language Models

Kasra Ahmadi, Rouzbeh Behnia, Reza Ebrahimi, Mehran Mozaffari Kermani,, Jeremiah Birrell, Jason Pacheco, Attila A Yavuz

PDF

Open Access 1 Repo

TL;DR

This paper introduces an interactive framework for privacy-preserving federated learning that balances privacy and utility, especially for resource-limited devices, using a novel fixed-memory differential privacy method and human-in-the-loop decision making.

Contribution

It presents the first framework integrating human privacy practitioners with a scalable fixed-memory differential privacy approach for federated learning on large language models.

Findings

01

Achieved stable memory usage during training.

02

Reduced accuracy loss by approximately 1.33% at ε=10.

03

Reduced accuracy loss by approximately 1.9% at ε=6.

Abstract

Federated learning (FL) enhances privacy by keeping user data on local devices. However, emerging attacks have demonstrated that the updates shared by users during training can reveal significant information about their data. This has greatly thwart the adoption of FL methods for training robust AI models in sensitive applications. Differential Privacy (DP) is considered the gold standard for safeguarding user data. However, DP guarantees are highly conservative, providing worst-case privacy guarantees. This can result in overestimating privacy needs, which may compromise the model's accuracy. Additionally, interpretations of these privacy guarantees have proven to be challenging in different contexts. This is further exacerbated when other factors, such as the number of training iterations, data distribution, and specific application requirements, can add further complexity to this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

KasraAhmadi/FL-Privacy-LLM
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data

MethodsADaptive gradient method with the OPTimal convergence rate