Fast and accurate imputation of summary statistics enhances evidence of functional enrichment
Bogdan Pasaniuc, Noah Zaitlen, Huwenbo Shi, Gaurav Bhatia, Alexander, Gusev, Joseph Pickrell, Joel Hirschhorn, David P Strachan, Nick Patterson,, Alkes L. Price

TL;DR
This paper introduces a fast, accurate Gaussian imputation method for summary statistics in GWAS, improving power and enabling functional enrichment analysis without needing individual-level data.
Contribution
The authors develop a novel Gaussian imputation approach for summary statistics that accounts for reference panel size, enhancing accuracy and computational efficiency over existing methods.
Findings
Recovers 84-87% of effective sample size for variants
Achieves 95-105% of effective sample size in empirical data
Increases evidence of functional enrichment at genic loci
Abstract
Imputation using external reference panels is a widely used approach for increasing power in GWAS and meta-analysis. Existing HMM-based imputation approaches require individual-level genotypes. Here, we develop a new method for Gaussian imputation from summary association statistics, a type of data that is becoming widely available. In simulations using 1000 Genomes (1000G) data, this method recovers 84% (54%) of the effective sample size for common (>5%) and low-frequency (1-5%) variants (increasing to 87% (60%) when summary LD information is available from target samples) versus 89% (67%) for HMM-based imputation, which cannot be applied to summary statistics. Our approach accounts for the limited sample size of the reference panel, a crucial step to eliminate false-positive associations, and is computationally very fast. As an empirical demonstration, we apply our method to 7…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
