Multi-Site Health Research Integrating Complementary Data Sources: A Scoping Review of Statistical Inference Methods for Vertically Partitioned Data
Marie-Pier Domingue, Simon L\'evesque, Anita Burgun, Jean-Fran\c{c}ois Ethier, F\'elix Camirand Lemyre

TL;DR
This scoping review examines existing statistical inference methods for vertically partitioned health data, highlighting their limitations in achieving equivalence, efficiency, and privacy in multi-institutional health research.
Contribution
It systematically characterizes current vertical methods in health research, revealing gaps in their ability to provide accurate, efficient, and privacy-preserving analysis results.
Findings
Most methods focus on linear and logistic regression.
Equivalence to pooled analysis is rarely systematically addressed.
Many methods require multiple communication rounds and lack comprehensive privacy assessments.
Abstract
To address the multidimensional nature of health-related questions, advances in health research often require integrating information from various data sources within statistical analyses. When complementary information pertaining to the same set of individuals are distributed across different institutions, vertical methods make it possible to obtain analysis results without sharing or pooling individual-level data. To guide stakeholders toward a transparent use of vertical methods, this study aims to (1) Identify existing vertical methods enabling statistical inference; and (2) Characterize the methodological properties of these methods and the current extent of their use with health data. We conducted a scoping review using four interdisciplinary databases. We then systematically extracted the characteristics of identified vertical methods with respect to comparability with the pooled…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
