Federated mixed effects logistic regression based on one-time shared summary statistics
Marie Analiz April Limpoco, Christel Faes, and Niel Hens

TL;DR
This paper introduces a privacy-preserving federated approach for mixed effects logistic regression that requires only a single round of summary statistic sharing, effectively handling heterogeneity without extensive communication or infrastructure.
Contribution
It proposes a novel method to estimate mixed effects logistic regression using pseudo-data matching summary statistics, reducing communication and security concerns.
Findings
Estimates are comparable to pooled data models in simulations.
Method effectively handles multiple predictors including continuous and categorical variables.
Approach is communication-efficient and avoids complex infrastructure.
Abstract
Upholding data privacy especially in medical research has become tantamount to facing difficulties in accessing individual-level patient data. Estimating mixed effects binary logistic regression models involving data from multiple data providers like hospitals thus becomes more challenging. Federated learning has emerged as an option to preserve the privacy of individual observations while still estimating a global model that can be interpreted on the individual level, but it usually involves iterative communication between the data providers and the data analyst. In this paper, we present a strategy to estimate a mixed effects binary logistic regression model that requires data providers to share summary statistics only once. It involves generating pseudo-data whose summary statistics match those of the actual data and using these into the model estimation process instead of the actual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTechnology and Data Analysis
