Revisiting Label Inference Attacks in Vertical Federated Learning: Why They Are Vulnerable and How to Defend
Yige Liu, Dexuan Xu, Zimai Guo, Yongzhi Cao, Hanpin Wang

TL;DR
This paper analyzes label inference attacks in vertical federated learning, revealing their vulnerabilities and proposing a layer adjustment defense that enhances privacy without overhead.
Contribution
It introduces the model compensation phenomenon, proves mutual information increases with layer depth, and proposes a zero-overhead layer shifting defense in VFL.
Findings
Disrupting feature-label distribution alignment reduces LIA success.
Layer shifting improves resistance to label inference attacks.
The proposed method enhances privacy across multiple datasets and models.
Abstract
Vertical federated learning (VFL) allows an active party with a top model, and multiple passive parties with bottom models to collaborate. In this scenario, passive parties possessing only features may attempt to infer active party's private labels, making label inference attacks (LIAs) a significant threat. Previous LIA studies have claimed that well-trained bottom models can effectively represent labels. However, we demonstrate that this view is misleading and exposes the vulnerability of existing LIAs. By leveraging mutual information, we present the first observation of the "model compensation" phenomenon in VFL. We theoretically prove that, in VFL, the mutual information between layer outputs and labels increases with layer depth, indicating that bottom models primarily extract feature information while the top model handles label mapping. Building on this insight, we introduce…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data · Domain Adaptation and Few-Shot Learning
