Mitigating Noise Detriment in Differentially Private Federated Learning with Model Pre-training

Huitong Jin; Yipeng Zhou; Quan Z. Sheng; Shiting Wen; Laizhong Cui

arXiv:2408.09478·cs.LG·October 10, 2025

Mitigating Noise Detriment in Differentially Private Federated Learning with Model Pre-training

Huitong Jin, Yipeng Zhou, Quan Z. Sheng, Shiting Wen, Laizhong Cui

PDF

Open Access

TL;DR

This paper introduces Pretrain-DPFL, a framework for fine-tuning pre-trained models in Differentially Private Federated Learning, which improves accuracy by systematically analyzing strategies and providing theoretical and empirical validation.

Contribution

It proposes a systematic evaluation of fine-tuning strategies in DPFL, with theoretical convergence analysis and extensive experiments demonstrating improved privacy-utility trade-offs.

Findings

01

Pretrain-DPFL achieves 25.22% higher accuracy than scratch training.

02

Unified-tuning outperforms other strategies in mitigating noise.

03

Theoretical conditions identify optimal fine-tuning strategies.

Abstract

Differentially Private Federated Learning (DPFL) strengthens privacy protection by perturbing model gradients with noise, though at the cost of reduced accuracy. Although prior empirical studies indicate that initializing from pre-trained rather than random parameters can alleviate noise disturbance, the problem of optimally fine-tuning pre-trained models in DPFL remains unaddressed. In this paper, we propose Pretrain-DPFL, a framework that systematically evaluates three most representative fine-tuning strategies: full-tuning (FT), head-tuning (HT), and unified-tuning(UT) combining HT followed by FT. Through convergence analysis under smooth non-convex loss, we establish theoretical conditions for identifying the optimal fine-tuning strategy in Pretrain-DPFL, thereby maximizing the benefits of pre-trained models in mitigating noise disturbance. Extensive experiments across multiple…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Stochastic Gradient Optimization Techniques