Bayesian Mendelian randomization with study heterogeneity and data partitioning for large studies
Linyi Zou, Hui Guo, and Carlo Berzuini

TL;DR
This paper introduces a Bayesian Mendelian randomization model that accounts for study heterogeneity and employs data partitioning to improve computational efficiency in large-scale studies, maintaining accuracy despite data missingness.
Contribution
It advances Bayesian MR by incorporating random effects for heterogeneity and using subset posterior aggregation to handle large datasets efficiently.
Findings
Random effect Bayesian MR outperforms inverse-variance weighted estimation.
Data partitioning minimally affects causal effect variation but impacts unbiasedness under certain conditions.
Partitioning improves computational efficiency with minimal bias increase.
Abstract
Background: Mendelian randomization (MR) is a useful approach to causal inference from observational studies when randomised controlled trials are not feasible. However, study heterogeneity of two association studies required in MR is often overlooked. When dealing with large studies, recently developed Bayesian MR is limited by its computational expensiveness. Methods: We addressed study heterogeneity by proposing a random effect Bayesian MR model with multiple exposures and outcomes. For large studies, we adopted a subset posterior aggregation method to tackle the problem of computation. In particular, we divided data into subsets and combine estimated subset causal effects obtained from the subsets". The performance of our method was evaluated by a number of simulations, in which part of exposure data was missing. Results: Random effect Bayesian MR outperformed conventional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Causal Inference Techniques · Genetic Associations and Epidemiology · Gene expression and cancer classification
