Data Privatization in Vertical Federated Learning with Client-wise Missing Problem
Huiyun Tang, Long Feng, Yang Li, Feifei Wang

TL;DR
This paper introduces a privacy-preserving framework for vertical federated learning with missing data, using Gaussian copulas and novel privatization mechanisms to ensure data privacy without sacrificing analysis utility.
Contribution
It proposes a copula-based data privatization framework for VFL with missing data, including new mechanisms and theoretical privacy-utility guarantees, applicable to mixed data types.
Findings
Effective privatization of VFL data with missingness.
Theoretical privacy and utility guarantees established.
Demonstrated success on simulations and real data.
Abstract
Vertical Federated Learning (VFL) often suffers from client-wise missingness, where entire feature blocks from some clients are unobserved, and conventional approaches are vulnerable to privacy leakage. We propose a Gaussian copulabased framework for VFL data privatization under missingness constraints, which requires no prior specification of downstream analysis tasks and imposes no restriction on the number of analyses. To privately estimate copula parameters, we introduce a debiased randomized response mechanism for correlation matrix estimation from perturbed ranks, together with a nonparametric privatized marginal estimation that yields consistent CDFs even under MAR. The proposed methods comprise VCDS for MCAR data, EVCDS for MAR data, and IEVCDS, which iteratively refines copula parameters to mitigate MAR-induced bias. Notably, EVCDS and IEVCDS also apply under MCAR, and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Random Matrices and Applications
