Federated generalized linear mixed models based on one-time shared summary statistics
Marie Analiz April Limpoco, Christel Faes, and Niel Hens

TL;DR
This paper introduces a privacy-preserving method for estimating generalized linear mixed models using only a single round of shared summary statistics, matching the accuracy of traditional methods.
Contribution
It proposes a novel approach that generates pseudo-data matching summary statistics, enabling accurate model estimation without sharing individual data.
Findings
Estimates from pseudo-data match those from actual data up to three decimal places.
The method achieves similar bias, coverage, and prediction performance as traditional approaches.
It significantly reduces communication and resource requirements.
Abstract
Data privacy has increasingly become a daunting challenge because it limits data availability, which is essential in estimating statistical models such as generalized linear mixed models. Access to personal data often involves considerable time, effort, and paperwork, which can impede research progress and collaboration. Existing approaches that do not use individual-level data for model estimation are either prone to ecological bias, cannot handle heterogeneity, or require iterative communication. In this paper, we propose an approach to estimate generalized linear mixed models based on summary statistics shared only once. We used linear, logistic, and Poisson mixed models as examples to demonstrate the methodology. Our strategy involves generating pseudo-data whose summary statistics match those of the actual but unavailable data. These pseudo-data are then used for model estimation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
