# Causal Forests Versus Inverse Probability of Treatment Weighting to Adjust for Cluster‐Level Confounding: A Parametric and Plasmode Simulation Study Based on US Hospital Electronic Health Record Data

**Authors:** Mike Du, Stephen Johnston, Paul M. Coplan, Victoria Y. Strauss, Sara Khalid, Daniel Prieto‐Alhambra

PMC · DOI: 10.1002/pds.70257 · Pharmacoepidemiology and Drug Safety · 2025-11-03

## TL;DR

This study compares two statistical methods for reducing bias in observational studies of implantable devices, finding that Causal Forests outperform Inverse Probability of Treatment Weighting when cluster-level confounding is strong.

## Contribution

The paper introduces a novel comparison of Causal Forests and IPTW for handling cluster-level confounding in real-world data.

## Key findings

- Causal Forests reduced bias more effectively than IPTW under strong cluster-level confounding.
- Performance differences were most notable when surgeon influence on treatment allocation was high.
- Results were consistent across parametric and plasmode simulations based on real hospital data.

## Abstract

Rapid innovation and new regulations increase the need for post‐marketing surveillance of implantable devices. However, complex multi‐level confounding related to patient‐level and surgeon or hospital covariates hampers observational studies of risks and benefits. We conducted two simulation studies to compare the performance of Causal Forests (CF) versus Inverse Probability of Treatment Weighting (IPTW) to reduce confounding bias in the presence of strong surgeon impact on treatment allocation.

Two Monte Carlo simulation studies were carried out: (1) Parametric simulations with patients nested in clusters (ratio 10:1, 50:1, 100:1, 200:1, 500:1) and sample size n = 10 000 were conducted with patient and cluster level confounders; (2) Plasmode simulations generated from a cohort of 9981 patients admitted for pancreatectomy between 2015 and 2019 from the US PINC AT hospital research database. Different CF algorithms and IPTW were used to estimate binary treatment effects.

Performance varied with the strength of cluster‐level confounding. Under weak to moderate surgeon influence, CF and IPTW performed similarly. When confounding was strong (OR = 2.5), CF reduced bias compared with IPTW: in parametric simulations, relative bias averaged 11.2% for CF versus 19.9% for IPTW, with similar advantages observed in plasmode simulations.

CF shows promise as a method for estimating treatment effects in scenarios where cluster‐level confounding strongly impacts treatment allocation. More research is needed to guide its use.

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12583492/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12583492/full.md

## References

35 references — full list in the complete paper: https://tomesphere.com/paper/PMC12583492/full.md

---
Source: https://tomesphere.com/paper/PMC12583492