Distributed Deep Variational Approach for Privacy-preserving Data Release

Zahir Alsulaimawi; Huaping Liu

arXiv:2605.03069·cs.CR·May 6, 2026

Distributed Deep Variational Approach for Privacy-preserving Data Release

Zahir Alsulaimawi, Huaping Liu

PDF

TL;DR

This paper introduces GPP, a privacy-preserving data release framework that learns to sanitize high-dimensional data representations, balancing utility and privacy in federated learning settings.

Contribution

It extends variational privacy methods to federated learning, enabling local data sanitization with minimal utility loss and strong privacy guarantees.

Findings

01

GPP achieves utility close to baseline autoencoders.

02

GPP reduces adversary's AUC to near random guessing.

03

Effective across multiple benchmark datasets.

Abstract

Federated learning (FL) lets distributed nodes train a shared model without exchanging their raw data, but in privacy-sensitive deployments medical sensors, IoT devices, wearables the protection offered by keeping data local is incomplete: gradients, model updates, and the released representations themselves can leak sensitive attributes. We propose the \emph{Gaussian Privacy Protector} (GPP), a data-release framework for continuous, high-dimensional inputs that learns a stochastic encoder mapping raw data to a low-dimensional sanitized representation. The encoder is trained against a variational lower bound on the mutual information between the released representation and a designated sensitive attribute, while a separate cross-entropy term preserves a designated utility attribute, with a Lagrange multiplier $β$ controlling the trade-off. We then extend GPP to the federated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.