Data Sharing in the PRIMED Consortium: Design, implementation, and recommendations for future policymaking
Johanna L. Smith, Quenna Wong, Whitney Hornsby, Matthew P. Conomos,, Benjamin D. Heavner, Iftikhar J. Kullo, Bruce M. Psaty, Stephen S. Rich,, Bamidele Tayo, Pradeep Natarajan, Sarah C. Nelson, Polygenic Risk Methods in

TL;DR
This paper details the design and implementation of data sharing policies within the PRIMED Consortium, addressing challenges of diverse datasets, legal compliance, and technical infrastructure to facilitate genomic research and provide policy recommendations.
Contribution
It introduces a comprehensive data sharing framework for a large genomic consortium, including mechanisms, technical solutions, and policy guidelines for diverse and international datasets.
Findings
Established coordinated data sharing mechanisms including dbGaP and agreements.
Implemented data sharing in the AnVIL cloud platform for secure access.
Proposed solutions for sharing data when individual-level data cannot be openly shared.
Abstract
Sharing diverse genomic and other biomedical datasets is critical to advance scientific discoveries and their equitable translation to improve human health. However, data sharing remains challenging in the context of legacy datasets, evolving policies, multi-institutional consortium science, and international stakeholders. The NIH-funded Polygenic Risk Methods in Diverse Populations (PRIMED) Consortium was established to improve the performance of polygenic risk estimates for a broad range of health and disease outcomes with global impacts. Improving polygenic risk score performance across genetically diverse populations requires access to large, diverse cohorts. We report on the design and implementation of data sharing policies and procedures developed in PRIMED to aggregate and analyze data from multiple, heterogeneous sources while adhering to existing data sharing policies for each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
