Privacy-Preserving and Lossless Distributed Estimation of High-Dimensional Generalized Additive Mixed Models
Daniel Schalk, Bernd Bischl, David R\"ugamer

TL;DR
This paper introduces a novel distributed, privacy-preserving, and lossless algorithm for estimating high-dimensional generalized additive mixed models using component-wise gradient boosting, enabling flexible modeling while respecting data privacy.
Contribution
The paper presents a new distributed algorithm for GAMM estimation that is privacy-preserving, lossless, and maintains the properties of the original CWB method, including high-dimensional feature handling.
Findings
Algorithm achieves equivalent results to pooled data estimation.
Effective in modeling heterogeneous data across sites.
Demonstrated on a heart disease dataset with competitive performance.
Abstract
Various privacy-preserving frameworks that respect the individual's privacy in the analysis of data have been developed in recent years. However, available model classes such as simple statistics or generalized linear models lack the flexibility required for a good approximation of the underlying data-generating process in practice. In this paper, we propose an algorithm for a distributed, privacy-preserving, and lossless estimation of generalized additive mixed models (GAMM) using component-wise gradient boosting (CWB). Making use of CWB allows us to reframe the GAMM estimation as a distributed fitting of base learners using the -loss. In order to account for the heterogeneity of different data location sites, we propose a distributed version of a row-wise tensor product that allows the computation of site-specific (smooth) effects. Our adaption of CWB preserves all the important…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Tensor decomposition and applications · Machine Learning in Healthcare
MethodsBalanced Selection · Feature Selection
