Neural Collapse Meets Differential Privacy: Curious Behaviors of NoisyGD   with Near-perfect Representation Learning

Chendi Wang; Yuqing Zhu; Weijie J. Su; Yu-Xiang Wang

arXiv:2405.08920·cs.LG·October 15, 2024

Neural Collapse Meets Differential Privacy: Curious Behaviors of NoisyGD with Near-perfect Representation Learning

Chendi Wang, Yuqing Zhu, Weijie J. Su, Yu-Xiang Wang

PDF

Open Access

TL;DR

This paper explores the intersection of Neural Collapse and differential privacy, providing theoretical insights and empirical evidence on feature quality, robustness issues, and strategies like PCA to improve private fine-tuning in deep learning.

Contribution

It offers a theoretical error bound related to Neural Collapse, analyzes the robustness of DP fine-tuning, and proposes practical methods like PCA to enhance privacy-preserving learning.

Findings

01

Neural Collapse leads to dimension-independent misclassification error under certain conditions.

02

Transformers produce better feature representations within the Neural Collapse framework.

03

PCA significantly improves testing accuracy in differentially private fine-tuning.

Abstract

A recent study by De et al. (2022) has reported that large-scale representation learning through pre-training on a public dataset significantly enhances differentially private (DP) learning in downstream tasks, despite the high dimensionality of the feature space. To theoretically explain this phenomenon, we consider the setting of a layer-peeled model in representation learning, which results in interesting phenomena related to learned features in deep learning and transfer learning, known as Neural Collapse (NC). Within the framework of NC, we establish an error bound indicating that the misclassification error is independent of dimension when the distance between actual features and the ideal ones is smaller than a threshold. Additionally, the quality of the features in the last layer is empirically evaluated under different pre-trained models within the framework of NC, showing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace Recognition and Perception · Stochastic Gradient Optimization Techniques · Domain Adaptation and Few-Shot Learning

MethodsPrincipal Components Analysis