A Differentially Private Probabilistic Framework for Modeling the Variability Across Federated Datasets of Heterogeneous Multi-View Observations
Irene Balelli, Santiago Silva, Marco Lorenzi

TL;DR
This paper introduces a hierarchical Bayesian federated learning framework with differential privacy guarantees for modeling data variability across heterogeneous multi-view datasets, demonstrated on medical imaging data for Alzheimer's disease.
Contribution
It presents a novel probabilistic federated learning approach combining hierarchical Bayesian modeling with differential privacy, optimized via EM, for multi-view heterogeneous data.
Findings
Robust performance on iid and non-iid data distributions
Effective quantification of data and view variability
High-quality data reconstruction comparable to state-of-the-art methods
Abstract
We propose a novel federated learning paradigm to model data variability among heterogeneous clients in multi-centric studies. Our method is expressed through a hierarchical Bayesian latent variable model, where client-specific parameters are assumed to be realization from a global distribution at the master level, which is in turn estimated to account for data bias and variability across clients. We show that our framework can be effectively optimized through expectation maximization (EM) over latent master's distribution and clients' parameters. We also introduce formal differential privacy (DP) guarantees compatibly with our EM optimization scheme. We tested our method on the analysis of multi-modal medical imaging data and clinical scores from distributed clinical datasets of patients affected by Alzheimer's disease. We demonstrate that our method is robust when data is distributed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMRI in cancer diagnosis · Privacy-Preserving Technologies in Data · Dementia and Cognitive Impairment Research
