An Interactive Framework for Implementing Privacy-Preserving Federated Learning: Experiments on Large Language Models
Kasra Ahmadi, Rouzbeh Behnia, Reza Ebrahimi, Mehran Mozaffari Kermani,, Jeremiah Birrell, Jason Pacheco, Attila A Yavuz

TL;DR
This paper introduces an interactive framework for privacy-preserving federated learning that balances privacy and utility, especially for resource-limited devices, using a novel fixed-memory differential privacy method and human-in-the-loop decision making.
Contribution
It presents the first framework integrating human privacy practitioners with a scalable fixed-memory differential privacy approach for federated learning on large language models.
Findings
Achieved stable memory usage during training.
Reduced accuracy loss by approximately 1.33% at ε=10.
Reduced accuracy loss by approximately 1.9% at ε=6.
Abstract
Federated learning (FL) enhances privacy by keeping user data on local devices. However, emerging attacks have demonstrated that the updates shared by users during training can reveal significant information about their data. This has greatly thwart the adoption of FL methods for training robust AI models in sensitive applications. Differential Privacy (DP) is considered the gold standard for safeguarding user data. However, DP guarantees are highly conservative, providing worst-case privacy guarantees. This can result in overestimating privacy needs, which may compromise the model's accuracy. Additionally, interpretations of these privacy guarantees have proven to be challenging in different contexts. This is further exacerbated when other factors, such as the number of training iterations, data distribution, and specific application requirements, can add further complexity to this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data
MethodsADaptive gradient method with the OPTimal convergence rate
