Best Practices to Design, Plan, and Execute Large-Scale Federated Analyses—Key Learnings and Suggestions from a Study Comprising 52 Databases
Theresa Burkard, Montse Camprubi, Daniel Prieto-Alhambra, Peter Rijnbeek, Marta Pineda-Moncusi

TL;DR
This paper shares best practices for conducting large-scale federated health data studies across 52 databases in 19 countries.
Contribution
The paper provides practical insights and strategies for planning and executing complex, large-scale federated analyses using the OMOP CDM.
Findings
Meticulous planning and strong collaboration are essential for successful federated analyses.
Standardized analytics and inclusive implementation of code improve study outcomes.
Positive feedback from data custodians highlights the value of community engagement and shared learning.
Abstract
Federated network studies allow data to remain locally while the research is conducted through the sharing of analytical code and aggregated results across different health care settings and countries. A large number of databases have been mapped to the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM), boosting the use of analytical pipelines for standardized observational research within this open science framework. Transparency, reproducibility, and robustness of results have positioned federated analyses using the OMOP CDM within the European Health Data and Evidence Network (EHDEN) as an essential tool for generating large-scale evidence. We conducted large-scale federated analyses involving 52 databases from 19 countries using the OMOP CDM. In this State-of-the-Art/Best Practice article, we aimed to share key lessons and strategies for conducting such…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHealth Systems, Economic Evaluations, Quality of Life · Health disparities and outcomes · Health, Environment, Cognitive Aging
