# LiMA: Robust inference of molecular mediation from summary statistics

**Authors:** Kaido Lepik, Chiara Auwerx, Marie C. Sadler, Adriaan van der Graaf, Sven Erik Ojavee, Zoltán Kutalik

PMC · DOI: 10.1016/j.ajhg.2025.12.005 · 2026-01-08

## TL;DR

LiMA is a new method that improves the accuracy of identifying molecular mediators in causal relationships between risk factors and complex traits.

## Contribution

LiMA introduces a likelihood-based framework that jointly models variability in summary statistics, reducing bias and false positives in mediation analysis.

## Key findings

- LiMA achieves several-fold lower bias and better type I error control compared to existing methods in simulations.
- Real data applications identified metabolites like glutamate and carnitine, and proteins mediating obesity-related cardiometabolic risk.
- LiMA accommodates variability in summary statistics, enabling robust mediation analysis across large mediator sets.

## Abstract

Understanding the molecular mechanisms mediating the causal effects of epidemiological risk factors on complex traits can advance targeted disease interventions. Statistical mediation analysis facilitates this by disentangling direct and indirect causal effects. Current approaches to causal mediation leverage Mendelian randomization, using summary statistics from the exposure, mediator, and outcome studies that estimate the genetic effects of instruments. However, differences in study sample sizes (measurement errors) lead to substantial biases and poorly controlled type I error rates for these methods, which become especially pronounced when simultaneously estimating the mediation proportion of numerous mediators. To address these limitations, we introduce Likelihood-based Mediation Analysis (LiMA), which estimates molecular mediation more accurately and robustly by jointly modeling the variability in all estimates involved. Through extensive simulation studies and benchmarking, we demonstrate that our approach achieves several-fold lower bias and improved control for type I error than state-of-the-art methods. Applying our method to real data highlighted several plausible metabolites—such as glutamate and carnitine—as well as proteins mediating the causal effects of obesity-related risk factors on cardiometabolic outcomes. These findings underscore the potential of our framework to reveal promising molecular pathways underlying complex diseases. By accommodating the variability inherent to summary statistics of varying precision, LiMA enables robust mediation analyses across large sets of mediators.

LiMA and its random-effect variant I-LiMA infer molecular mediation in a Mendelian randomization framework using summary statistics. Jointly modeling direct and mediated effects while accounting for measurement error, they reduce weak-instrument bias and false positives in simulations. Applications to real data reveal metabolites and proteins mediating obesity-related cardiometabolic risk.

## Linked entities

- **Chemicals:** glutamate (PubChem CID 611), carnitine (PubChem CID 288)
- **Diseases:** obesity (MONDO:0011122)

## Full-text entities

- **Diseases:** obesity (MESH:D009765)
- **Chemicals:** glutamate (MESH:D018698), carnitine (MESH:D002331)

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12824626/full.md

---
Source: https://tomesphere.com/paper/PMC12824626