Federated Statistical Analysis: Non-parametric Testing and Quantile Estimation
Ori Becher, Mira Marcus-Kalish, David M. Steinberg

TL;DR
This paper investigates the impact of federated analysis on nonparametric testing and quantile estimation, demonstrating only modest efficiency loss while proposing a privacy-preserving data sharing policy.
Contribution
It analyzes the effects of federated analysis on basic statistical tasks and introduces a K-anonymity based privacy policy for data sharing.
Findings
Modest loss in statistical efficiency for nonparametric tests and quantile estimation
Proposed a K-anonymity privacy policy adopted by European research platform
Federated analysis can be effective with limited efficiency trade-offs
Abstract
The age of big data has fueled expectations for accelerating learning. The availability of large data sets enables researchers to achieve more powerful statistical analyses and enhances the reliability of conclusions, which can be based on a broad collection of subjects. Often such data sets can be assembled only with access to diverse sources; for example, medical research that combines data from multiple centers in a federated analysis. However these hopes must be balanced against data privacy concerns, which hinder sharing raw data among centers. Consequently, federated analyses typically resort to sharing data summaries from each center. The limitation to summaries carries the risk that it will impair the efficiency of statistical analysis procedures. In this work we take a close look at the effects of federated analysis on two very basic problems, nonparametric comparison of two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Advanced Causal Inference Techniques · Statistical Methods and Inference
