Addressing overlapping sample challenges in genome-wide association studies: Meta-reductive approach
Farid Rajabli, Azra Emekci

TL;DR
This paper introduces a new method to improve the accuracy of genetic risk scores by adjusting for overlapping datasets in genetic studies.
Contribution
The novel Meta-Reductive Approach (MRA) recalibrates GWAS summary statistics to neutralize overlapping sample influences.
Findings
MRA recalibrates summary statistics using algebraic derivations to match individual-level data results.
Validation on Alzheimer's datasets showed MRA summary statistics matched individual-level data exactly.
MRA enhances PRS accuracy when using meta-analyzed GWAS data.
Abstract
Polygenic risk scores (PRS) are instrumental in genetics, offering insights into an individual level genetic risk to a range of diseases based on accumulated genetic variations. These scores rely on Genome-Wide Association Studies (GWAS). However, precision in PRS is often challenged by the requirement of extensive sample sizes and the potential for overlapping datasets that can inflate PRS calculations. In this study, we present a novel methodology, Meta-Reductive Approach (MRA), that was derived algebraically to adjust GWAS results, aiming to neutralize the influence of select cohorts. Our approach recalibrates summary statistics using algebraic derivations. Validating our technique with datasets from Alzheimer disease studies, we showed that the summary statistics of the MRA and those derived from individual-level data yielded the exact same values. This innovative method offers a…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12
Figure 13Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic Associations and Epidemiology · Bioinformatics and Genomic Networks · Genetic Mapping and Diversity in Plants and Animals
