Methods for Prioritizing Causal Genes in Molecular Studies of Human Disease: The State of the Art

Karina Patasova; Bahar Sedaghati‐Khayat; Rachel Knevel; Heather J. Cordell; Arthur G. Pratt

PMC · DOI:10.1002/gepi.70037·March 2, 2026

Methods for Prioritizing Causal Genes in Molecular Studies of Human Disease: The State of the Art

Karina Patasova, Bahar Sedaghati‐Khayat, Rachel Knevel, Heather J. Cordell, Arthur G. Pratt

PDF

Open Access

TL;DR

This paper reviews methods for identifying genes that cause human diseases using genetic and molecular data.

Contribution

The paper provides an updated overview of causal gene inference methods and their interplay in molecular studies.

Findings

01

Colocalization helps identify shared genetic signals between traits.

02

Mendelian randomization improves causal inference by reducing confounding.

03

Network-based approaches model complex gene relationships but face limitations like pleiotropy.

Abstract

In the last decade, genome‐wide association studies (GWAS) have identified tens of thousands of common variants associated with a wide array of complex traits and diseases. Integration of GWAS with molecular data has informed the development of statistical tools for causal gene discovery. In this paper, we give an overview of commonly used causal inference methods and discuss the strengths and limitations of colocalization, Mendelian randomization (MR) and network‐based approaches. Colocalization is often used to assess whether the genetic association signals for two traits arise from the same causal variant, thereby strengthening inferred causal associations. MR was developed to tackle issues of confounding and reverse causality, providing a rigorous approach to causal inference and demonstrating improved false discovery rates. Unlike MR, network‐based analyses employ a discovery…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Figures4

Click any figure to enlarge with its caption.

Five different colocalization scenarios. Panel A shows a scenario where a trait of interest and a QTL involve two distinct causal variants in a linkage disequilibrium. Panels B and C display colocalization. In panel B, a trait and QTL are independent but share a single common causal variant. Panel C shows a situation where a trait and QTL also share a single common causal variant, but the trait influences the QTL. Panel D and E display the situation where the trait and QTL have a shared causal variant in conjunction with distinct causal variants. Traditional colocalization methods cannot differentiate between situations where the trait and QTL are unrelated (B and D) and those where either the trait or QTL influence each other. Figure adapted from Zuber V et al. “Combining evidence from Mendelian randomization and colocalization: Review and comparison of approaches” (Zuber et al. 2022). The figure was produced in Biorender.

Interplay between different causal inference methods. Figure was produced in Biorender.

Overview of principles and core assumptions of Mendelian Randomization. Figure was produced in Biorender.

Schematic representation of network‐based approaches for causal inference. Genetic variants can be used as anchors in causal networks to help orient the edges (dashed lines) between different nodes, here represented by different risk factors. Figure was produced in Biorender.

Tables2

Table 1. Colocalization methods.

Method	Description	Strengths	Limitations
Colocalization Tests of Two Genetic Traits (coloc) (Giambartolomei et al. 2014); Hypothesis Prioritization for multi‐trait Colocalization (HyPrColoc) (Foley et al. 2021)	Leverages the Bayesian framework to estimate posterior probability of trait and eQTL sharing a causal SNP within the same locus. Only infers colocalization when there is compelling evidence supporting it.	Computationally efficient and can be applied across multiple loci; uses a Bayesian framework that allows prior specification, generating interpretable probabilities rather than binary decisions; can be extended to multi‐trait colocalization, using HyPrColoc.	Assumes one causal variant per trait per region, does not directly model LD; requires overlapping SNPs between datasets; sensitive to prior specifications.
Sum of Single Effects (SuSiE); coloc. susie (Wallace 2021; Zou et al. 2022)	Conducts fine‐mapping by breaking down genetic associations into sum of single effects processed in sparse model fitted for each trait separately. Separates statistical evidence of association for each variant while conditioning on causal signal at the locus.	Improves resolution for loci with allelic heterogeneity; LD structure is incorporated in the model; aids prioritization by quantifying probability of SNP being causal.	Assumes additive and linear effects; misestimation of number variants in credible sets or inaccurate LD matrix can lead to misleading results.
Expression Quantitative Trait Loci and GWAS Colocalization Analysis via Integrated Association Rates eCAVIAR (Hormozdiari et al. 2016)	Specifically designed to detect colocalization between GWAS and eQTL data by calculating colocalization posterior probabilities.	Models multiple causal variants at the locus by enumerating different causal variant configurations and computing joint probabilities; can detect loci where causal variants are different in GWAS and eQTL data; provides a confidence interval for colocalization of GWAS variants; functional annotations can be used to improve prior specification and fine‐mapping; incorporates LD matrix to account for the correlated structure between SNPs.	Computationally intensive; requires accurate ancestry‐matched LD reference panel.
Colocalization and Fine‐mapping in the presence of Allelic Heterogeneity (CAFEH) (Arvanitis et al. 2022a)	Implements a hierarchical Bayesian model to perform fine‐mapping and colocalization across multiple traits.	Detects shared and distinct causal variants across multiple traits, tissues and cell types, thereby increasing statistical power; models allelic heterogeneity and is able to identify multiple causal variants at the locus.	Computationally intensive due to Bayesian inference procedure; requires accurate ancestry‐matched LD reference panel; assumes equal prior probability for all variants at the locus; unable to detect the presence of multiple causal association signals at the locus that has variants in high LD; doesn't allow missing values in effect size matrix.
Shared sparse Projection for colocalization analysis (SharePro)	Implements effect group‐level approach for colocalization	Handles allelic heterogeneity and LD structure by grouping correlated variants and assessing colocalization at effect group level.	High prior colocalization probabilities increase rates of false positives; requires ancestry‐matched LD reference panel.
Colocalization Quantitative Trait Loci Analysis (ColocQuiaL) (Chen et al. 2022)	Platform that simplifies and streamlines colocalization analysis at scale.	Automates colocalization analyses; accepts a variety of summary statistics formats.	Assumes a single causal variant at the locus; doesn't perform multi‐trait colocalization.
Easy Quantitative Trait Loci (ezQTL) (Zhang et al. 2022)	Platform that simplifies and streamlines colocalization analysis at scale.	Hosts GWAS and QTL public datasets; performs data quality control, LD visualization and paired colocalization analyses; allows multi‐locus query.	Uses a multi‐trait statistical colocalization analyses approach that assumes a single shared causal variant at the locus.

Table 2. Mendelian randomization methods.

Method	Description	Strengths	Limitations
Wald Ratio test (Burgess et al. 2017)	Provides a causal estimate for a single IV by dividing the beta coefficient of SNP‐outcome association by the beta coefficient of SNP‐exposure association.	Easy to implement and interpret.	Sensitive to pleiotropy and weak instrument bias.
Inverse variance weighted (IVW) (Burgess et al. 2013)	Combines Wald ratio estimates in a fixed effects meta‐analysis, weighting each ratio by the inverse of its variance. Assumes no horizontal pleiotropy and is sensitive to weak instrument bias.	Increased statistical power.	Sensitive to SNP heterogeneity pleiotropy and instruments invalidity.
Weighted Median (Bowden et al. 2016b)	Estimates the causal effect as the median of the weighted distribution of ratio estimates. Requires that at least 50% of the weight comes from valid instruments.	Less sensitive to outliers with pleiotropic effects than IVW.	Requires a large number of IVs and assumes that 50% are valid.
MR‐Egger (Bowden et al. 2015)	Aggregates Wald‐ratio estimates into meta‐regression, while adjusting for directional pleiotropy. Intercept term is used to test for pleiotropy under InSIDE assumption.	Detects and handles pleiotropy by including an intercept term in the regression model	Less statistically powered than IVW test, sensitive to measurement error.
Summary data‐based Mendelian randomization (SMR) (Zhu et al. 2016)	Aggregates summary data from independent GWAS and eQTL studies to ascertain genes whose expression levels show associations with a trait due to pleiotropy.	Handles large molecular data and identifies genes whose expression levels influence trait.	Can't distinguish between vertical and horizontal pleiotropy and make conclusions about causal associations; is limited to gene expression and doesn't include other mechanisms such as post‐translational modifications; relies on the quality and resolution of GWAS.
Mendelian randomization with linkage disequilibrium and pleiotropy (MR‐link) (van der Graaf et al. 2020)	Employs eQTL summary statistics and individual level data for exposure and outcome, correcting for LD and unobserved pleiotropy without removing pleiotropic IVs.	Explicitly models LD and unobserved pleiotropy without needing to remove pleiotropic variants; derives robust estimates even with small number of IVs.	Requires individual level data; computationally intensive due matrix and LD modelling.
Effective‐Median‐based Mendelian randomization (EMIC) (Jiang et al. 2022)	Identifies false‐positives due to LD and correlation between IVs, using multiple cis‐eQTLs for pleiotropy fine‐mapping. Employs eigenvalue decomposition matrix estimated from ancestry matched reference panel.	Explicitly models LD between IVs and mitigates pleiotropy by using a median‐based estimator without needing to exclude pleiotropic variants.	Requires ancestry‐matched reference panel; computationally intensive due to eigenvalue decomposition of LD matrix; relies on the quality and resolution of GWAS.
Mendelian randomization with correlated horizontal pleiotropy (MR‐Corr²) ⁽Cheng et al. 2022a ⁾	Designed to tackle correlated horizontal pleiotropy where genetic variants influence both exposure and outcome variables through shared biologic pathways	Handles correlated horizontal pleiotropy by incorporating a correlation parameter in MR model; allows to use correlated IVs rather than requiring LD pruning by building LD matrix.	Performs under assumption that pleiotropic effects are either entirely correlated or uncorrelated; assumes that only a small proportion of IVs have pleiotropic effects; doesn't account for sample overlap. between exposure and outcome datasets; computationally intensive due to requiring correlation matrices; relies on the quality and resolution of GWAS.
Causal Analysis Using Summary Effect estimates (CAUSE) (Morrison et al. 2020)	Explicitly models both correlated and uncorrelated pleiotropic effects, mitigating reverse causation and weak instrument bias.	Model unobserved pleiotropy and doesn't require prior information about shared heritable factors	LD pruning may exclude informative variants in regions with high LD; doesn't allow to include known shared factors (e.g., measured confounders or mediators) in the model; relies on the quality and resolution of GWAS.
Mendelian randomization with correlated horizontal pleiotropy unraveling shared etiology and confounding (MR‐CUE) (Cheng et al. 2022b)	Considers both shared and IV‐specific correlated horizontal pleiotropy effects. Maps IVs with correlated horizontal pleiotropy to cis‐associated genes and enriched pathways.	Correlated horizontal pleiotropy is detected and adjusted by using a Bayesian framework; provides robust estimates in cross‐population analyses.	Computationally intensive due to requiring correlation matrices; assumes that all IVs have potential uncorrelated horizontal pleiotropy, while only small number of IVs have correlated horizontal pleiotropy; requires dozens of IVs to detect and delineate correlated horizontal pleiotropy; accurate LD reference panel is required relies on the quality and resolution of GWAS.
MR‐Horse (Grant and Burgess 2024)	Uses horse‐shoe prior and allows adaptation of MR models to specific scenarios by adjusting the global shrinkage parameter.	Does not rely on the InSIDE assumption, models both correlated and uncorrelated pleiotropy, using a Bayesian framework; LD structure is incorporated in the likelihood.	Bayesian inference with LD matrices and multivariable models is computationally intensive; models are sensitive to prior specifications; relies on the quality and resolution of GWAS.

Funding3

—European Union's Horizon grants
—ZonMw Klinische Fellow
—ZonMw Open Competitie

Keywords

causal inferencecausal networkcolocalizationMendelian randomization

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenetic Associations and Epidemiology · Advanced Causal Inference Techniques · Bioinformatics and Genomic Networks

Full text

Introduction

1

Genome‐wide association studies (GWAS) have identified tens of thousands of genetic associations with complex traits and diseases, revealing that these conditions are often associated with multiple independent common polymorphisms (Uffelmann et al. 2021). This approach has provided insights into the allelic architecture of complex traits and suggests potential genetic heterogeneity among affected individuals. However, up to 90% of GWAS‐identified variants lie in non‐coding regions with some of these associations affecting gene expression (Uffelmann et al. 2021). However, by and large, the functional impact of intergenic markers remains unclear. Moreover, GWAS signals typically include lead variants that tag many correlated SNPs in a specific genomic region and are not necessarily causal markers. Disease‐associated genetic loci often harbor multiple mechanistically plausible genes, many of which could be causal based on proximity alone; this makes it difficult to infer causal candidates from based on GWAS findings (Uffelmann et al. 2021).

In the context of post‐GWAS analyses, the term “causal gene” denotes the gene at an associated locus through which, by its function, sequence variation directly impacts trait etiology (Costanzo et al. 2025). Although “causal” or “causative gene” is commonly used in casual inference literature, it has been argued that “effector gene” may be a more appropriate term because (i) it does not suggest certainty or deterministic causality (Costanzo et al. 2025), which can rarely be proved using the high‐throughput technologies typically deployed by researchers, and (ii) because it is more precisely the sequence variations themselves that are causal, leading to observable changes in gene function with their etiological consequences (Costanzo et al. 2025). Whilst acknowledging this point, we use the term “casual gene” in this review for consistency.

To address inherent complexities of causal inference for candidate gene prioritization, an array of methods has been developed. These methods systematically integrate GWAS and multi‐omics data, including gene expression, metabolomics, and proteomics, thereby expediting biomarker profiling and discovery (Cui et al. 2022). Omics refers to a comprehensive study of biological molecules, which include genes, transcripts, proteins, methylation and metabolites (Song et al. 2020). Quantitative trait loci (QTL) are regulatory regions that contain genetic variation influencing the levels of various omics molecules, frequently through genetic and gene‐environment interactions (Powder 2020). QTL‐based causal inference methods are widely used and can be broadly classified as colocalization analyses, instrumental variable (IV) approaches, and network‐based causal frameworks (Chen et al. 2024; Zuber et al. 2022; Yin et al. 2024).

Establishing causality in complex biological systems requires a careful consideration of potential confounding factors. Colocalization analysis serves as a crucial preliminary step in a larger causal inference pipeline that establishes whether GWAS and QTL signals in a given genomic region share a single causal variant or result from multiple (separate) causal variants (Giambartolomei et al. 2014; Wu et al. 2019). By statistically determining the likelihood of shared causal markers, colocalization can facilitate selection of instrumental variables (IVs) for subsequent causal inference methods but also can be used to refine causal associations with effector genes (Zuber et al. 2022).

IV analysis, with Mendelian randomization (MR) being the most prominent example, utilizes genetic variants as uncounfounded instruments (IVs) to establish causal relationships between an exposure (such as gene expression) and an outcome (such as disease risk) (Chen et al. 2024). Unlike observational studies that are prone to residual confounding and reverse causation, MR exploits Mendel's law of random assortment of alleles at conception, thereby creating conditions similar to a randomized control trial (Chen et al. 2024). This principle allows MR to identify candidate genes by explicitly testing whether predicted changes in biomarker levels have a causal influence on a particular trait (Zhu et al. 2016).

It is crucial to differentiate MR from related but separate approaches such as transcriptome‐wide association studies (TWAS) (de Leeuw et al. 2023). While both methods leverage GWAS and multi‐omics data to prioritize candidate genes, TWAS surveys genes whose genetically predicted expression levels are associated with the trait (de Leeuw et al. 2023), using the same conceptual framework as MR but without formally acknowledging (or attempting to mitigate) any violations of IV assumptions. Generally, TWAS findings should not be interpreted as evidence of causality due to inherent limitations of this method, namely, genetic correlation, linkage disequilibrium and horizontal pleiotropy (de Leeuw et al. 2023). The role of TWAS in the broader causal inference framework, including MR, will be discussed later as part of the MR section of this review.

Beyond hypothesis‐driven approaches like MR, network‐based causal inference analyses offer an alternative discovery‐oriented framework for understanding complex biological systems (Yazdani et al. 2022). In particular, network‐based causal inference methods are able to reconstruct key regulatory hubs and causal pathways by inferring causal and mediatory relationships from high‐dimensional multi‐omics data (Yazdani et al. 2022). MR principles are embedded in the network construction with nodes representing different variables (including genetic variants) and edges denoting directional relationships (Yazdani et al. 2022).

In this review, we set out to give a comprehensive synthesis of state‐of‐the‐art causal inference methods and their application in gene prioritization. We discuss fundamental principles of MR, colocalization and network‐based causal inference approaches, outlining their strengths, limitations, and complementary roles. The objective of this paper is to guide readers in strategically combining these methodologies to enhance the accuracy and robustness of statistical causal inference, thereby fostering a more comprehensive perspective on gene prioritization in post‐GWAS analyses. Concluding each section, we present examples of methodologically sound applications of causal inference, contrasting them with exemplars of key methodological challenges encountered in causal reasoning research.

Colocalization

2

Historical Context and General Use Cases

2.1

Genetic markers that are located near each other on the genome are usually inherited together and are strongly correlated (i.e. are in linkage disequilibrium (LD)) (Zuber et al. 2022). LD presents a challenge when analyzing a large number of genetic associations, as two traits may be associated with distinct variants that are correlated with each other (Zuber et al. 2022). Colocalization methods were introduced to help address the lack of mechanistic clarity from GWAS and determine whether different traits (such as phenotypic outcome and gene expression) share causal markers (Zhang et al. 2022) (Table 1). Colocalization is often used in conjunction with MR analyses and can help evaluate the validity of MR assumptions in a given genetic locus. Colocalization assesses the overlap between causal variants for two or more traits by considering alternative scenarios at the locus: two traits with distinct causal variants in LD with each other vs a single shared association signal (colocalization) (Figure 1) (Hukku et al. 2021).

Five different colocalization scenarios. Panel A shows a scenario where a trait of interest and a QTL involve two distinct causal variants in a linkage disequilibrium. Panels B and C display colocalization. In panel B, a trait and QTL are independent but share a single common causal variant. Panel C shows a situation where a trait and QTL also share a single common causal variant, but the trait influences the QTL. Panel D and E display the situation where the trait and QTL have a shared causal variant in conjunction with distinct causal variants. Traditional colocalization methods cannot differentiate between situations where the trait and QTL are unrelated (B and D) and those where either the trait or QTL influence each other. Figure adapted from Zuber V et al. “Combining evidence from Mendelian randomization and colocalization: Review and comparison of approaches” (Zuber et al. 2022). The figure was produced in Biorender.

Core Approaches

2.2

Colocalization approaches can be subdivided into two major families: proportional and enumeration methods. Proportional colocalization tests the hypothesis that a sole causal variant accounts for genetic associations with distinct traits: proportionality of GWAS and eQTL regression coefficients supports this hypothesis, but can also indicate that the two traits belong to the same causal pathway (Zuber et al. 2022). Enumeration colocalization leverages a Bayesian framework to estimate the posterior probability of a trait and eQTL sharing a causal SNP. One advantage of the enumeration approach is that it only infers colocalization when there is strong evidence supporting it. In the absence of strong evidence, the posterior probabilities will be drawn towards the prior probabilities in order to minimize false positive associations (Zuber et al. 2022).

Relationships With Other Causal Inference Methods

2.3

Within the broader landscape of causal inference, colocalization holds a distinct yet complementary position alongside other established analytical approaches. When integrated with other methods, colocalization enhances specificity and mechanistic understanding of genetic associations with complex traits. Colocalization is commonly used to contextualize TWAS and MR signals and verify shared genetic basis between gene expression and specific trait (Zuber et al. 2022). Colocalization can also help validate specific components of causal networks (Aygün et al. 2023). The precise interplay between colocalization and other causal inference approaches will be discussed in later sections (Figure 2).

Interplay between different causal inference methods. Figure was produced in Biorender.

Recent Advancements in Colocalization

2.4

Some more recent developments in the area of colocalization include changes to the single causal variant assumption of the original Colocalization Tests of Two Genetic Traits (coloc) method (Giambartolomei et al. 2014), such that the causality of multiple variants within the same locus can be tested simultaneously (Wallace 2021) (Table 1). For example, sum of single effects (SuSiE) carries out fine‐mapping by breaking down genetic associations into the SuSiE that are later processed in a sparse model fitted for each trait separately (Wang et al. 2020; Zou et al. 2022). SuSiE separates the statistical evidence of association for each variant while conditioning on causal signal at the locus (Zou et al. 2022). By contrast, expression quantitative trait loci (eQTL) and GWAS Causal Variants Identification in Associated Regions (eCAVIAR) calculates posterior probabilities, using a model that allows an arbitrary number of causal markers in the locus and considers all possible causal SNP combinations (Hormozdiari et al. 2016). The Colocalization and Fine‐mapping in the presence of Allelic Heterogeneity (CAFEH) method implements a hierarchical Bayesian model to perform fine‐mapping and colocalization across multiple traits (Arvanitis et al. 2022a). Shared sparse Projection for colocalization analysis (SharePro) method combines LD modelling and colocalization by aggregating correlated markers into effect groups (Zhang et al. 2024) (Table 1).

Allelic heterogeneity (Wu et al. 2019), where multiple genetic markers within the same locus are associated with the same trait, presents a significant challenge to colocalization analysis. In the presence of allelic homogeneity, colocalization approaches that assume a single causal variant may erroneously infer colocalization. The presence of multiple causal variants at the locus can potentially reduce the precision of colocalization analyses by diluting the causal association signal and making it harder to pinpoint the exact genetic cause. Similarly, misspecification of prior enrichment levels has also been shown to contribute to spurious colocalization findings (Hukku et al. 2021). Therefore, it has been recommended that prior enrichment levels should be estimated from observed data, and fine‐mapping and colocalization analyses should be carried out separately (Hukku et al. 2021). Careful consideration of analytical factors, such as prior specification and model assumptions, as well as the use of complementary methods, including TWAS and MR, can improve the reliability of colocalization results (Al‐Barghouthi et al. 2022; Rasooly et al. 2023). Platforms such as Colocalization Quantitative Trait Loci Analysis (ColocQuiaL) and Easy Quantitative Trait Loci (ezQTL), further simplify and streamline colocalization analysis at scale **(**Zhang et al. 2022; Chen et al. 2022) (Table 1).

Practical Scenarios

2.5

Scenario 1—Colocalization Performing Well

2.5.1

The study by Franceschini et al. on carotid intima‐media thickness (CIMT) and carotid plague integrated GWAS with eQTLs across multiple human tissues (Franceschini et al. 2018). Gene regulation has been shown to be tissue‐specific (Arvanitis et al. 2022b) and impact the accuracy, interpretability and biological relevance of colocalization. Given that CIMT and plaque are pathologies of the arterial wall, strategic selection of arterial tissue as a target eQTL in colocalization analyses allowed the authors to directly link genetic associations to changes in expression in the primary affected tissue (Franceschini et al. 2018). Colocalization signals observed in disease‐relevant tissue supported biological interpretability and strengthened the evidence that candidate genes influenced atherosclerosis progression (Franceschini et al. 2018). Additional conditional analyses were employed to identify independently associated variants; this step ensured that colocalization signals weren't erroneously attributed to distinct genetic markers in LD (Franceschini et al. 2018). To better outline potentially causal associations, colocalization signals were further prioritized using functional annotation and linking CIMT and carotid plague to hard clinical endpoints such as CAD and various types of strokes (Franceschini et al. 2018). By establishing genetic correlations between CIMT and carotid plague and other cardiovascular outcomes, the study strengthened the argument that genes identified though colocalization were relevant for overall risk of severe cardiovascular events (Franceschini et al. 2018).

Scenario 2—Colocalization Producing Potentially Misleading Results

2.5.2

Shandrina and colleagues performed a study that sought to prioritize genes associated with coronary artery disease (CAD) (Shadrina et al. 2020). This investigation highlighted some of the limitations of colocalization analyses (Shadrina et al. 2020). The authors acknowledged that low statistical power and strict significance thresholds, especially in case of weak or marginally significant colocalization signals, might have contributed to false negative associations (Shadrina et al. 2020). Another limiting factor was that per SNP samples were not available and eQTL effect sizes were estimated from Z‐scores, which might have introduced additional variation and affected the accuracy of estimated effect sizes (Shadrina et al. 2020). Incomplete or suboptimal data can hinder the ability of colocalization to detect true causal signals, as the majority of current colocalization methods necessitate full summary statistics for both traits (King et al. 2021). In addition to incomplete summary statistics, another challenge of colocalization pertained to multiple causal associations and complex LD structure. As acknowledged by study authors, in scenarios where multiple causal variants were in a strong LD at particular locus, it was difficult for traditional colocalization methods to discern which variant was a source of colocalization signal (Shadrina et al. 2020).

Mendelian Randomization

3

Historical Context and General Use Cases

3.1

In biomedical research, genetic polymorphisms are often employed as IVs based on the principles of Mendelian genetics which assert that parental matings and the transfer of alleles from parent to the offspring occur randomly. Thus, individuals in the population can be divided into subgroups based on their genetic risk in a way that is comparable to randomization in randomized control trials (RCTs) (Hingorani and Humphries 2005). The MR approach was first proposed by Gray and Wheatley (Gray and Wheatley 1991), who developed this method in order to tackle some of the methodological issues associated with observational studies such as residual confounding and reverse causation (Chen et al. 2024; Sanderson et al. 2022; Khasawneh et al. 2022). Initially, MR was limited to individual‐level data which included genotypes, exposures and outcomes measured within the same dataset, also known as “one‐sample MR” (Zuber et al. 2022). Subsequent advancements included the incorporation of summary statistics (beta coefficients and their standard errors) from published GWAS, signifying genetic associations between exposure and outcome (Zuber et al. 2022). Introduction of this “two‐sample MR” approach enabled derivation of IVs associated with exposure and outcome from different datasets (Zuber et al. 2022), leading to widespread use of MR with thousands of MR studies currently published (de Leeuw et al. 2022a).

Core Approach

3.2

The validity of MR rests on three core assumptions (de Leeuw et al. 2022a) (Figure 3):

a.Relevance assumption: the IV is robustly associated with exposure either directly or through LD with a causal variant.
b.Independence assumption: There are no measured or unmeasured confounders influencing the IV or the outcome.
c.Exclusion restriction: the IV is not directly associated with an outcome but influences it only indirectly through the exposure. This assumption instrument strength is independent of the direct effect.

Overview of principles and core assumptions of Mendelian Randomization. Figure was produced in Biorender.

Additional MR assumptions include:

a.The Instrument Strength Independent of Direct Effect (InSIDE) assumption: IV‐exposure association is not correlated with any direct effects that IV has over the outcome (de Leeuw et al. 2022b).
b.Homogeneity assumption: The effects of IVs on the exposure should be consistent across different subgroups (Small et al. 2017).
c.No Measurement Error (NOME) assumption: There should be no measurement error in IVs (Bowden et al. 2016a).

MR can be implemented in several ways depending on available data and the studied exposure. In one‐sample MR a two‐stage least squares method can be used to assess the causal effect of a particular exposure (e.g. eQTL) on an outcome of interest (e.g. myocardial infarction). The classical MR Wald ratio test provides a causal estimate for a single IV by calculating the ratio of outcome and exposure beta coefficients (Burgess et al. 2017). Inverse‐variance weighted variance (IVW) test aggregates ratio estimates of several IVs in a fixed effects meta‐analysis, weighting each ratio by the inverse of its variance (Burgess et al. 2013; Bowden et al. 2019). In the presence of horizontal pleiotropy, which describes phenomena where a genetic variant (IV) is independently associated with both exposure and the outcome of interest, IVW (Burgess et al. 2013) estimates of this nature may be biased; hence, SNPs contributing to heterogeneity are usually excluded. Although alternative tests, such as weighted median (Bowden et al. 2016b), MR‐Egger (Bowden et al. 2015), and model‐based MR, are comparatively less well‐powered, they can evaluate the validity of IVs and produce more robust estimates of causal effect (Sanderson et al. 2022). See Table 2 for a listing of commonly used MR approaches that are described in this review.

MR has been extended to allow prioritization of functionally pertinent genes at genetic loci reported by GWAS. If a genetic variant impacts expression levels of a given gene (i.e. the gene is subject to an eQTL), varying expression levels will be detected among individuals with different genotypes, conveying a concept similar to overexpression/suppression experiments. Subsequently, if gene expression levels also have an influence on a trait, there will be observable phenotypic differences depending on genotype (Zhu et al. 2016). MR can thus be used analogously to test for the causal effects of gene expression on complex traits. However, the statistical power of MR to detect these causal associations depends on the proportion of outcome variance explained by the exposure, the proportion of exposure variance explained by the IVs and the overall sample size (Brion et al. 2013). Given the polygenicity of most complex human traits, the variance of gene expression or phenotype explained by a single polymorphism (SNP) is usually small, such that sample sizes exceeding tens of thousands of individuals may be necessary to identify a gene's causal effect using MR. Summary data‐based MR (SMR) analysis, introduced in 2016, aggregated summary data from independent GWAS and eQTL studies in order to ascertain genes whose expression levels showed associations with a trait because of pleiotropy (Zhu et al. 2016). LD can also bias SMR, especially when gene expression is used as exposure: strong cis‐eQTLs located near their transcripts are usually correlated. Both pleiotropy and LD are violations of the third MR principle.

In standard MR, the effect of gene expression (x) on trait (y) can, in the absence of non‐genetic confounders, be expressed as a ratio of the IV (z) effects on x and y, respectively:

[eqn]

In the presence of latent non‐genetic confounding and under assumption of either causality or horizontal pleiotropy, the performance of MR and SMR was shown to be equivalent (Zhu et al. 2016). Statistical power of SMR analyses increases significantly when GWAS data derived independently of the eQTL dataset are employed (Zhu et al. 2016). Within SMR, the “heterogeneity in dependent instruments” (HEIDI) test has been developed to distinguish between pleiotropy and LD. The latter scenario occurs when the top cis‐eQTL is in LD with two discrete causal variants— one influencing gene expression and the other affecting trait variance (Zhu et al. 2016). Under pleiotropy, where a causal variant influences both gene expression levels and trait, all genetic markers in LD with a causal variant are expected to have same b _ xy _ values as a causal variant. HEIDI assesses the heterogeneity of the SNPs' b _ xy _ values in the same cis‐eQTL region. Importantly, SMR utilizes a univariable single instrument MR approach and therefore is not able to discriminate between pleiotropy and causality.

Transcriptome‐Wide Association Studies and Mendelian Randomization

3.3

TWAS, while not a formal type of IV analyses, shares conceptual similarities with MR in that both methods use genetic variation to investigate links between molecular traits and phenotypes. TWAS represent a family of statistical methods that has been widely used to find associations between complex traits and gene expression and search for causal gene candidates by combining GWAS and gene expression predictors from eQTL cohorts (de Leeuw et al. 2023). Although TWAS findings are often interpreted as evidence of a genetic relationship between phenotype and gene expression, this interpretation is not congruent with a null hypothesis that TWAS tests (de Leeuw et al. 2023). TWAS models test for a relationship between the phenotype and the genetically predicted component of gene expression, ignoring any uncertainty in the predicted gene expression. This means that, instead of directly testing the relationship between gene expression and phenotype, TWAS effectively assesses genetic association between the phenotype and SNPs local to the gene, irrespective of any genetic relationship with the gene expression (de Leeuw et al. 2023). Additionally, the extent to which standard errors vary across genes depends on gene size and strength of the association between gene expression and trait; all of these factors complicate post‐hoc corrections (de Leeuw et al. 2023). TWAS misapplication leads to inflated type 1 error rates and spurious associations that can render up to 40% of significant results invalid (de Leeuw et al. 2023)

Alternative methods that could fulfill the intended role of TWAS include local genetic correlation analyses, MR and colocalization (de Leeuw et al. 2023). All of these approaches require different assumptions but can be used to evaluate local genetic associations between gene expression and different traits (de Leeuw et al. 2023). Standard methods for one‐ or two‐sample MR allow for the uncertainty in the prediction of exposure, while more advanced MR methods (Table 2) allow one to detect and discount some of the pleiotropic IVs that would ordinarily be used to predict the gene expression. Nonetheless, it must be acknowledged that measurement error is a persistent issue not only in TWAS but across all causal inference methods (de Leeuw et al. 2023).

Relationships With Other Causal Inference Methods

3.4

The foundational principles and diverse applications of MR extend to other causal inference methods, facilitating more robust and comprehensive understanding of genetic causality. This section examines these synergistic relationships, detailing how MR can be applied to enhance colocalization and TWAS (Figure 2). The relationship between MR and network‐based approaches will be discussed later.

Colocalization

3.4.1

MR and colocalization address different but related questions. While MR aims to determine the nature of relationship between exposure and outcome, colocalization assesses whether two traits are influenced by the same causal genetic variants – without needing to pre‐specify either of the traits as an exposure or outcome (Hukku et al. 2021). Although colocalization itself provides more robust mechanistic evidence of shared genetic etiology than simply observing statistical significance in the same genetic region for both traits (King et al. 2021), MR's causal framework can help to interpret colocalization results in a causal context. Integrative analyses combining MR and colocalization can highlight the genes involved in putative causal pathways (Liu et al. 2020). Colocalization identifies target genomic regions and causal variants, whereas MR estimates the overall causal effect of the candidate gene under an assumed direction of association (Zuber et al. 2022). Colocalization is often performed as a part of a broader causal inference pipeline and can be employed to validate IV assumptions prior to MR. By validating the shared genetic basis between GWAS loci and molecular traits, colocalization ensures integrity of genetic instruments. In cases where there is sufficient evidence that exposure and outcome are actually influenced by separate causal variants, it is unlikely that markers in that particular locus can be used as valid instruments for MR (Zuber et al. 2022). When both MR and colocalization provide concordant support, this significantly strengthens evidence confirming the causal role of the candidate gene in the trait development.

Transcriptome‐Wide Association Studies

3.4.2

As mentioned, TWAS primarily evaluate the joint association between trait and SNPs in the region, rather than directly testing the causal role of gene expression (de Leeuw et al. 2023). Despite this distinction, MR principles have been adapted for TWAS applications, specifically by employing a probabilistic MR likelihood framework that unifies many TWAS and MR methods (Yuan et al. 2020). Using this approach, TWAS associations are modelled with multiple correlated SNPs that are later tested in an MR likelihood model explicitly accounting for horizontal pleiotropy (Yuan et al. 2020). Probabilistic MR methods use the burden test for horizontal pleiotropy modelling assumption and generalize MR‐Egger regression to multiple correlated IVs, thereby controlling for both correlated and uncorrelated pleiotropic effects (Yuan et al. 2020).

Recent Advancements in Mendelian Randomization

3.5

Multi‐instrument and multivariable MR has been successfully adapted to gene expression exposures by tailoring the inverse‐variance weighted method to GWAS and eQTL summary statistics, and assessing the multivariate causal effect of all genes at the given locus (Porcu et al. 2019a). This particular approach necessitates inclusion of all potential sources of measured pleiotropy in the model (Porcu et al. 2019a; Burgess and Thompson 2015). Other approaches that correct for pleiotropy have either entirely excluded pleiotropic IVs from models (Zhu et al. 2016; Verbanck et al. 2018; Zhu et al. 2018) or required exposures and outcomes to be derived from the same dataset (Berzuini et al. 2020a)—but both approaches risk loss of information and decrease in statistical power. Capturing all sources of pleiotropy is not always feasible, especially when the source originates from a gene in a different tissue or other unmeasured biomarkers or phenotypes (van der Graaf et al. 2020).

Several novel MR methods were developed in the last 5 years to address these methodological constraints. It has been demonstrated that a major source of pleiotropy in transcriptome‐wide MR comes from eQTLs of genes that are in LD with the primary IVs (van der Graaf et al. 2020). To address this issue, Mendelian randomization with Linkage Disequilibrium and Pleiotropy (MR‐link) employs eQTL summary statistics and individual‐level data for the exposure and outcome, respectively, and makes a casual inference about the effect of exposure while correcting for LD and unobserved pleiotropy. MR‐link does not require removal of pleotropic IVs and instead jointly analyses genetic markers that are in LD with exposure eQTLs (van der Graaf et al. 2020) (Table 2).

Traditional MR methods tackle correlated horizontal pleiotropy by assuming that it affects all IVs and estimating how much shared heritable factors contribute to IV‐outcome relationships (Morrison et al. 2020; Xue et al. 2021; Wang et al. 2021). The effective‐median‐based MR framework (EMIC) identifies false‐positives due to LD and/or correlation between IVs and uses multiple cis‐eQTLs to perform pleiotropy fine‐mapping (Jiang et al. 2022). In contrast to alternative methods, EMIC employed eigenvalue decomposition matrix estimated from ancestry matched reference panel, making it robust to the LD noise and discrepancies (Jiang et al. 2022) (Table 2).

The issue of IV invalidity due to correlated pleiotropy has also been tackled by introduction Bayesian framework to MR. Traditional frequentist methods that assess the violation of INSIDE assumption, rely on fulfilment of specific strict conditions and often suffer from inflated type I error rates (Morrison et al. 2020). Bayesian framework for MR was first introduced in a seminal work by Berzuini et al. who proposed a horseshoe prior to address the invalidity of IVs (Berzuini et al. 2020b). However, this particular method still relied on INSIDE assumption and required individual‐level data (Berzuini et al. 2020b), with subsequent improvements extending to summary data‐based Bayesian MR (Zhao et al. 2020; Bucur et al. 2020). Other Bayesian methods, such as Mendelian randomization with Correlated Horizontal Pleiotropy) (MR‐Corr^2^), tackled correlated pleiotropy by incorporating a correlation parameter in the model (Cheng et al. 2022a). Nonetheless, MR‐Corr^2^ was limited by assumption that pleiotropic effects were either entirely correlated or uncorrelated with IV effects on exposure (Cheng et al. 2022a). By contrast, Causal Analysis Using Summary Effect estimates (CAUSE) (Morrison et al. 2020)and Correlated horizontal pleiotropy Unravelling shared Etiology and confounding (MR‐CUE) (Cheng et al. 2022b), considered both correlated and uncorrelated pleiotropic effects, avoiding reverse causation and mitigating weak instruments bias (Cheng et al. 2022b). MR‐CUE mapped IVs with correlated horizontal pleiotropy to their respective cis‐associated genes and enriched pathways, informing shared genetic etiology between exposure and outcome (Cheng et al. 2022b). However, CAUSE and MR‐Cue that the association of IVs with exposure was either entirely due to confounders or solely because of a direct effect (Morrison et al. 2020; Cheng et al. 2022b) Recently developed MR‐Horse does not rely on INSIDE assumption, handles both correlated and uncorrelated pleiotropy, and retains low rates of type I error under wide range of different circumstances (Grant and Burgess 2024). MR‐Horse allows to adapt MR models to specific scenarios by adjusting the global shrinkage parameter (Grant and Burgess 2024).

Practical Scenarios

3.6

Scenario 1—Mendelian Randomization Performing Well

3.6.1

Li et al. performed proteome‐wide MR analyses systematically investigating causal associations between 2490 circulating proteins 19 different cardiovascular diseases (CVDs) (Li et al. 2025). Instead of focusing on single protein‐disease associations, this compressive approach facilitated thorough exploration of protein‐mediated causal pathways across diverse CVD pathologies (Li et al. 2025). The analyses aggregated summary statistics from the largest published proteome dataset and meta‐analysis of GWAS on CVDs in European and Asian populations (Li et al. 2025). Integration of large well‐characterized datasets increased statistical power for detecting genuine causal associations (Li et al. 2025). Genetic instruments were derived separately for European and East‐Asian samples and analyzed with CVDs from corresponding ancestry GWAS (Li et al. 2025). This multi‐ancestry approach allowed to control for ancestry‐specific alleles, improving the accuracy of causal estimates and generalizability of study findings (Li et al. 2025). Bi‐directional MR was implemented to differentiate causal candidate proteins from reverse causality, a concern that was highlighted in previous MR investigations, that found a partial overlap between molecular causes and consequences of the disease (Li et al. 2025). MR identified 218 proteins that causally influenced the risk of one or several CVDs; of them 111 were novel associations (Li et al. 2025). Results of forward and reverse MR detected only 2 overlapping signals, which indicated that proteomic causes and consequences of CVDs were largely distinct (Li et al. 2025). Interestingly, among 15 protein biomarkers whose expression levels were influenced by CVD status, 5 were specific to East‐Asian populations which showcased the value of a multi‐ancestry MR approach (Li et al. 2025). Genes exclusive to East‐Asian ancestry among other things had cardioprotective effects on cardiomyocyte survival and stress‐induced angiogenesis (Hedhli et al. 2014) and were implicated in overweight‐related hypertension in Han‐Chinese (Zhu et al. 2021a). Candidate causal proteins for CVDs included drug targets, namely, ANGPTL3, ECE1 and PCSK9 inhibitors, that were already approved for CVD treatment or in development, thereby exemplifying successful application of MR in pharmacogenomics (Li et al. 2025). Among novel gene candidates, BTN3A2 exhibited a relatively large causal effect on the risk of ischemic stroke; this association was consistent across different datasets and analyses with a crosstalk between immunomodulation and stroke emerging as a likely mechanism (Li et al. 2025). MR utilizing single‐cell RNA sequencing showed causal associations between PAM and LPL enrichment in cardiomyocytes and the risk of angina pectoris, ventricular arrythmia and peripheral artery diseases, respectively (Li et al. 2025). A study Li and colleagues showcased the potential of leveraging human genetics and MR for full‐scale novel drug discovery (Li et al. 2025).

Scenario 2—Mendelian Randomization Producing Potentially Misleading Results

3.6.2

A study by Wu et al. investigated potential drug targets for MI by integrating European MI GWAS and blood plasma proteome data (Wu et al. 2023). Although the study identified LPA and APOA5 as putative causal genes, there were several limitations pertaining to MR analysis (Wu et al. 2023). In MR, blood plasma cis‐pQTLs were used as exposure variables, and MI was the outcome (Wu et al. 2023). Associations between MI and blood plasma proteins represented by a single pQTL were analyzed using the Wald ratio test, whereas proteins with multiple IVs were assessed by the IWV test (Wu et al. 2023). Although both approaches were valid, the Wald‐ratio test in particular severely limited the options for MR sensitivity analyses, such as MR‐Egger regression or heterogeneity test (Wu et al. 2023). Conversely, the IVW test required that the sum of horizontal pleiotropic effects across all IVs was equal to zero, assuming that strength of IVs was also not related to pleiotropy (Burgess et al. 2013). The study did not formally test for pleiotropy, which meant that one of the core MR assumptions ‐ that IV affects the outcome through exposure‐ could not be verified (Wu et al. 2023). Even though cis‐pQTLs were believed to be less vulnerable to pleiotropy compared to trans‐pQTLs (Zheng et al. 2020), they still could affect neighboring genes, as seen in studies of gene expression (Zheng et al. 2020). Additionally, MR analyses, where genetic instruments were restricted to a single region, not only exhibited reduced statistical power but also inflated type I error rates (van der Graaf et al. 2025). Inclusion of non‐pleiotropic trans‐pQTL in MR analyses could potentially improve the consistency of protein‐trait associations by increasing the variance explained by target proteins (Zheng et al. 2020). Moreover, this approach would ensure that casual estimates were not reliant on a single genetic locus and additional sensitivity analyses could be performed (Zheng et al. 2020). By instrumenting blood plasma protein levels, Wu et al. selected a tissue that might not have been relevant for the development of MI (Wu et al. 2023). Mismatches between tissues where protein exposures were measured and disease‐relevant pQTLs can lead to spurious associations (Porcu et al. 2019b). Overall, the investigation by Wu and colleagues exemplifies how availability of rich and diverse QTL data is currently one of the major obstacles in causal inference research.

Network‐Based Approaches

4

Historical Context and General Use Cases

4.1

Although MR has gained popularity as a method for causal inference in recent years, limited options exist for its use in the systemic interrogation of omic data (Yazdani et al. 2022). Indeed, its hypothesis‐driven approach does not consider complex relationships and interactions between diverse types of molecules that may contribute to disease development; even when multiple exposures are considered (as in multivariable MR), the goal is to assess individual causes (Yazdani et al. 2022). Furthermore, limited prior knowledge about the relationships between different genetic/environmental risk factors makes it challenging to delineate explanatory, mediating and response variables using MR.

In contrast to MR, network‐based analyses can be used to systematically integrate genomic and multi‐omics data, attempting to infer causal structures (Liu et al. 2009; Mezlini and Goldenberg 2017); as such, these approaches are better suited for multi‐omics data typically characterized by high dimensionality, interdependency between layers and pleiotropy. The origins of network‐based causal inference approaches can be traced to the 20th century when Sewall Wright laid foundations for structural equation modelling aimed at describing relationships between different variables (Niles 1922). So‐called causal networks (CNs) provide structured discovery‐based analyses of complex data where each entity, represented by nodes in the network, can be an explanatory variable, mediator or a response (Yazdani et al. 2022). While MR employs genetic variants as IVs to isolate causal effects in observational studies, network‐based approaches use directed acyclic graphs to model conditional dependencies among variables (Figure 4) (Howey et al. 2020).

Schematic representation of network‐based approaches for causal inference. Genetic variants can be used as anchors in causal networks to help orient the edges (dashed lines) between different nodes, here represented by different risk factors. Figure was produced in Biorender.

Core Approach

4.2

It is important to remember that MR principles are embedded in the causal framework of network‐based models. One of the advantages of network‐based approaches over MR is that they can handle large datasets such as omics measurements. MR approaches can be used to help build CNs depending on the type of available omics data (Yazdani et al. 2022). Associations identified in MR can be incorporated as “genetic anchors” within the causal network, guiding and constraining the network structure (Howey et al. 2020). This way MR helps improve the accuracy and biological plausibility of inferred relationships by grounding them in strong causal evidence that is less affected by confounding compared to purely observational associations (Yazdani et al. 2022). QTLs can be included as IVs for genes whose levels are coded by one or two genetic markers, whereas polygenic factors may explain a larger proportion of phenotypic variation and show stronger associations with other explanatory variables in the model (Yazdani et al. 2022). The polygenic approach provides additional benefits over MR, such as reduction in spurious and highly sensitive estimates as well as enhanced statistical power by assigning multiple IVs to explanatory variables (Yazdani et al. 2022). Bayesian networks (BN) are probabilistic graphical models signifying a set of entities and conditional relationships between them as directed acyclic graphs (DAGs) (Puga et al. 2015). Independent variables and probabilistic dependencies in the DAG are represented by nodes and edges (Puga et al. 2015). The conditional probability distribution of each node quantifies the effects of its parent nodes, allowing the interpretation of edges as causal associations between exposure and outcome (Puga et al. 2015). BNs with genetic anchors can infer causal relationships between multiple genetic and non‐genetic risk factors at the same time, revealing complex causal networks, particularly in the context of horizontal pleiotropy (Howey et al. 2020).

Relationships With Other Causal Inference Methods

4.3

Network‐based analyses provide a causal inference framework that can complement and improve utility and interpretability of colocalization, Mendelian randomization and TWAS analyses by integrating their findings into a cohesive, systemic view of biological processes (Figure 2).

Colocalization

4.3.1

Network‐based causal inference methods can foster synthesis of colocalization evidence across multiple omic layers by jointly modelling gene products and improving causal variant resolution (Okamoto et al. 2025). This multi‐layered approach helps to discern whether an observed colocalization signal truly reflects shared causality or is confounded by other regulatory mechanisms. Causal networks can link genetic variants through various molecular intermediates, ultimately revealing a more complete mechanistic milieu (Agamah et al. 2022). Placing colocalization signals in a broader etiological context shows how shared genetic markers influence biological systems through involvement with upstream regulators, downstream effectors and other regulatory mechanisms. When colocalization points to a shared genetic basis between two nodes in the network, the network structure can inform interactions with other molecules or pathways implicated in shared genetic mechanisms (Barrio‐Hernandez et al. 2023).

Mendelian Randomization

4.3.2

Foundational MR principles can be embedded in the construction of causal networks by implementing methodological advancements that extend the core logic of using genetic variants as IVs (Yazdani et al. 2022) Many of network‐based causal inference methods use a two‐step approach, whereby they initially draw a totality of causal influences among traits and later refine these general connections by pinpointing direct causal relationships in the network (Lin et al. 2023). In this initial step, extended MR methods are employed to infer all possible causal connections without distinguishing between immediate effects and intermediate associations (Lin et al. 2023).

Causal networks expand MR principles to encompass multiple exposures and outcomes, thereby moving beyond the hypothesis‐driven paradigm of MR (Yazdani et al. 2022). The discovery‐based approach implemented in network‐based causal inference enables mapping of causal pathways and cascades that include numerous complex interactions (Lin et al. 2023). Traditional MR mediation analyses typically focus on one mediator in a specific causal pathway, and while extensions exist (Yang et al. 2024a), they can become unwieldy when multiple mediators are considered. In contrast to MR, causal networks assess the overall architecture of mediation rather than isolated mediation paths by quantifying contributions of multiple intermediate variables and indirect effects (Lin et al. 2023). Network‐based causal inference methods are able to isolate direct effects by applying graph deconvolution to the network of total effects (Lin et al. 2023; Feizi et al. 2013). Additionally, by analyzing conditional dependencies and the overall structure of the network, network‐based causal inference methods can detect and differentiate various forms of genetic confounding (Yazdani et al. 2022; Amar et al. 2021), whereas MR often operates under simplified assumptions and requires additional sensitivity analyses (Carter et al. 2021).

Transcriptome‐Wide Association Studies

4.3.3

Although TWAS can inform on the relationship between gene expression and phenotype at the genomic region level, they overlook complex gene‐gene interactions and interdependencies and are unable to distinguish direct causal effects from downstream effects within the same pathway (Subirana‐Granés et al. 2025; Pividori et al. 2023). Additionally, as mentioned previously, TWAS findings can be confounded by factors such as LD and pleiotropy which may result in false‐positive associations (de Leeuw et al. 2023). Network‐based approaches offer structural context for TWAS‐reported genes and refine their prioritization by assessing their position and connectivity within a wider biological network (Kaushal and Singh 2020; Jin et al. 2022). This enables a more accurate detection of causal genes by embedding them in the framework that considers their interdependencies (Jin et al. 2022). When a broader regulatory topology and interactions between genes are considered, genetic confounding arising from pleiotropy and LD can also be minimised (Sun et al. 2024; Zhu et al. 2021b). Additionally, network‐based causal inference methods can be used to forecast genetic contribution of gene expression by integrating distal genetic variants that operate through gene regulatory networks, consistent with omnigenic model of trait inheritance (Mohammad and Michoel 2024), Using this approach, Bayesian networks capture causal and coexpression relationships between genes and improve gene expression prediction accuracy which is crucial for TWAS (Mohammad and Michoel 2024).

Recent Advancements in Network‐Based Methods

4.4

BNs address some of MR's shortcomings, particularly when handling high‐dimensional datasets with multiple genetic predictors and potential risk factors (Howey et al. 2020). The advantages of BNs have been demonstrated in situations with widespread pleiotropy, where this method outperformed bi‐directional MR in terms of type I error control and statistical power (Howey et al. 2020). BNs have been successfully used to scan for disease‐associated genes across a wide range of complex disorders like breast cancer, bone metastasis (Park et al. 2018) and Alzheimer's (Sherif et al. 2015). For gene regulatory network inference, hybrid methods integrating differential equations with dynamic Bayesian networks have been developed. The Differential Equation based Local Dynamic Bayesian Network (DELDBN) algorithm demonstrated a notable enhancement in accuracy and scalability when applied to yeast and human cell data, successfully reconstructing known interactions around the BRCA1 gene (Li et al. 2011). In analyses of protein sequences, feature extraction methods using Bayesian networks and Dirichlet distributions have been employed for causal inference (Liu et al. 2009). Additionally, recent developments like the Causal Inference Using Composition of Transactions (CICT) method has shown promising results in uncovering gene regulatory networks from single‐cell RNA‐seq data, outperforming existing network inference methods (Shojaee and Huang 2023). For situations with scarce data, knowledge transfer techniques that utilize data from other sources have been suggested (Rodríguez‐López and Sucar 2022). To ensure accurate and reliable BN analysis, missing information can be imputed using the nearest neighbor approach (Howey et al. 2021) or survival tree analysis (Rancoita et al. 2016).

Violations of the core MR assumption result in unstable networks. When directional anchors are not included in BNs, several issues can arise, including poor accuracy, confounding and pleiotropy (Xue et al. 2021) all of which contribute to less stable networks. The assessment of network stability involves an array of methodologies. In bootstrapping, the variability of estimated network parameters is evaluated by resampling of the data with subsequent replacement (Dikopoulou 2021). Sensitivity analyses test how systemic variations in inputs and parameters of the models affect network stability (Yazdani et al. 2022). The performance of network‐based approaches under different conditions, such as sample size and data noisines,s can be assessed in simulation studies (Yazdani et al. 2022; Dikopoulou 2021)

BNs augmented with principles of MR provide a valuable complementary approach to existing causal inference methods, particularly in the context of modern high‐dimensional omics (Howey et al. 2020; Evans and Davey Smith 2015). Ongoing research efforts continue to improve BNs scalability and applicability to large‐scale genomics (Ji et al. 2015; Larjo et al. 2013).

Practical Scenarios

4.5

Scenario 1—Network‐Based Approaches Performing Well

4.5.1

A study conducted by Zeng et al. exemplifies how network‐based causal inference analyses can be successfully utilized to uncover genetic variance and causal mechanisms underlying CAD (Zeng et al. 2019). This particular investigation sought to assess heritability contributions of SNPs associated with gene‐expression (e‐SNPs) in CAD‐related gene‐regulatory networks. Seven vascular and metabolic eQTL datasets from patients with CAD were used to detect e‐SNPs and build co‐expression networks (Zeng et al. 2019). Prior information from co‐expression network eQTLs was utilized in the Bayesian algorithm that inferred gene regulatory networks (GRNs) with causal links to CAD (Zeng et al. 2019). This approach allowed the detection not only of disease‐associated putatively causal genes but also of entire etiological pathways. By incorporating the genetics of gene expression, causal networks revealed directionality of gene interactions and identified hub genes that functioned as key network drivers, regulating downstream genes (Zeng et al. 2019). The causal role of these “hub” genes was further supported by in vitro models that demonstrated changed activity of entire GRNs active in CAD and impact on CAD phenotype (Zeng et al. 2019). Importantly, among all analyzed tissues, fat and arterial wall exhibited the strongest influence of the risk of CAD (Zeng et al. 2019). GRNs involved, among known CAD risk factors, RNA metabolism, DNA binding and blood coagulation mechanisms. Moreover, Zang and colleagues determined that e‐SNPs underpinning gene regulatory networks explained an additional 10% of CAD heritability not explained by GWAS (Zeng et al. 2019). Twenty‐eight GRNs were replicated in an independent dataset which underscored the robustness of the study findings (Zeng et al. 2019).

Scenario 2—Network‐Based Approaches Producing Potentially Misleading Results

4.5.2

Bowles and colleagues employed the BN approach in the analyses of modifiable and non‐modifiable cardiovascular risk factors (Bowles et al. 2024). The study determined that beyond traditional risk factors, such as BMI and physical activity, ethnicity and exposure to heavy metals were the strongest drivers of CVD Blood levels of heavy metals mediated the relationship between ethnicity and cardiovascular disease (Bowles et al. 2024). Although the study demonstrated the utility of using BNs for understanding complex interactions between different variables influencing CVD risk, there were several notable limitations that could affect the accuracy of the results and interpretation of causal associations (Bowles et al. 2024). For example, ascertaining directions of the edges in the network relied on specific assumptions. The edge was included in the network if it showed up between the two nodes in at least 51% of bootstrapped networks (Bowles et al. 2024). Consequently, this approach assumed that the relationships between variables were stable under bootstrapping conditions (Bowles et al. 2024). Despite statistical support provided by the BN model, inferred causal directions might not fully capture true biological causality without additional sensitivity analyses evaluating model assumptions (Bowles et al. 2024). Additionally, even with the implementation of BNs, concurrent study design meant that the possibility of reverse causation and bidirectional relationships could not be completely ruled out (Bowles et al. 2024). Longitudinal, experimental and quasi‐experimental methods could be adopted as robust alternatives, ensuring the integrity of BN modelling.

Future Perspectives

5

Integration of methods from the causal inference literature into genomic studies has shown great promise in unravelling complex biological relationships, with techniques like MR, and colocalization as well as network‐based approaches identifying potential causal genes and furthering the understanding of genetic architecture of complex traits and diseases (van der Graaf et al. 2020; Howey et al. 2020; Zhao et al. 2024; Yang et al. 2024b). However, as mentioned, these methods face challenges in the presence of factors such as pleiotropy, LD, and genetic confounding. TWAS and other eQTL‐based methods, in particular, may prioritize multiple genes at a locus, some of which may be non‐causal due to shared eQTLs (Wainberg et al. 2019). To address these issues, novel approaches like MR‐link and causal‐TWAS (cTWAS) have been developed, which account for unobserved pleiotropy and genetic confounding (van der Graaf et al. 2020; Zhao et al. 2024).

Looking ahead, the integration of multiple data types and the development of more sophisticated statistical frameworks will be crucial. The advent of methods combining colocalization and MR analysis for loci with allelic heterogeneity represents a promising step in this direction (Zhu et al. 2021c). Additionally, the incorporation of tissue‐specific eQTL data and the consideration of context‐dependent effects will be important for improving the accuracy of causal inference (Wainberg et al. 2019).

Conflicts of Interest

The authors declare no conflicts of interest.

Bibliography106

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Agamah, F. E. , J. R. Bayjanov , and A. Niehues , et al. 2022 November 14. “Computational Approaches for Network‐Based Integrative Multi‐Omics Analysis.” Frontiers in Molecular Biosciences 9: 967205. https://www.frontiersin.org/journals/molecular-biosciences/articles/10.3389/fmolb.2022.967205/full.36452456 10.3389/fmolb.2022.967205 PMC 9703081 · doi ↗ · pubmed ↗
2Al‐Barghouthi, B. M. , W. T. Rosenow , K. P. Du , et al. 2022 November 23. “Transcriptome‐Wide Association Study and e QLT Colocalization Identify Potentially Causal Genes Responsible for Human Bone Mineral Density Gwas Associations.” e Life 11: e 77285.36416764 10.7554/e Life.77285 PMC 9683789 · doi ↗ · pubmed ↗
3Amar, D. , N. Sinnott‐Armstrong , E. A. Ashley , and M. A. Rivas . 2021 January 13. “Graphical Analysis for Phenome‐Wide Causal Discovery in Genotyped Population‐Scale Biobanks.” Nature Communications 12, no. 1: 350.10.1038/s 41467-020-20516-2PMC 780664733441555 · doi ↗ · pubmed ↗
4Arvanitis, M. , K. Tayeb , B. J. Strober , and A. Battle . 2022 a February 3. “Redefining Tissue Specificity of Genetic Regulation of Gene Expression in the Presence of Allelic Heterogeneity.” American Journal of Human Genetics 109, no. 2: 223–239.35085493 10.1016/j.ajhg.2022.01.002PMC 8874223 · doi ↗ · pubmed ↗
5Arvanitis, M. , K. Tayeb , B. J. Strober , and A. Battle . 2022 b February 3. “Redefining Tissue Specificity of Genetic Regulation of Gene Expression in the Presence of Allelic Heterogeneity.” American Journal of Human Genetics 109, no. 2: 223–239.35085493 10.1016/j.ajhg.2022.01.002PMC 8874223 · doi ↗ · pubmed ↗
6Aygün, N. , D. Liang , W. L. Crouse , G. R. Keele , M. I. Love , and J. L. Stein . 2023 May 30. “Inferring Cell‐Type‐Specific Causal Gene Regulatory Networks During Human Neurogenesis.” Genome Biology 24, no. 1: 130.37254169 10.1186/s 13059-023-02959-0PMC 10230710 · doi ↗ · pubmed ↗
7Barrio‐Hernandez, I. , J. Schwartzentruber , A. Shrivastava , et al. 2023 March “Network Expansion of Genetic Associations Defines a Pleiotropy Map of Human Cell Biology.” Nature Genetics 55, no. 3: 389–398.36823319 10.1038/s 41588-023-01327-9PMC 10011132 · doi ↗ · pubmed ↗
8Berzuini, C. , H. Guo , S. Burgess , and L. Bernardinelli . 2020 a January 1. “A Bayesian Approach to Mendelian Randomization With Multiple Pleiotropic Variants.” Biostatistics 21, no. 1: 86–101.30084873 10.1093/biostatistics/kxy 027PMC 6920542 · doi ↗ · pubmed ↗