# Estimating average causal effects with incomplete exposure and confounders

**Authors:** Lan Wen, Glen McGee

PMC · DOI: 10.1515/jci-2023-0083 · Journal of Causal Inference · 2026-02-20

## TL;DR

This paper introduces new methods to estimate causal effects when data on exposures and confounders are missing, improving accuracy in observational studies.

## Contribution

The paper proposes novel doubly robust estimators for handling missing not at random data in causal effect estimation.

## Key findings

- Standard multiple imputation techniques can be biased under MNAR assumptions.
- Doubly robust TMLE estimators remain unbiased even if some models are misspecified.
- The methods were applied to study opioid effects on mortality using NHANES data.

## Abstract

Standard methods for estimating average causal effects require complete observations of the exposure and confounders. In observational studies, however, missing data are ubiquitous. Motivated by a study on the effect of prescription opioids on mortality, we propose methods for estimating average causal effects when exposures and potential confounders may be missing. We consider missingness at random and additionally propose several specific missing not at random (MNAR) assumptions. Under our proposed MNAR assumptions, we show that the average causal effects are identified from the observed data and derive corresponding influence functions, which form the basis of our proposed estimators. Our simulations show that standard multiple imputation techniques paired with a complete data estimator is unbiased when data are missing at random (MAR) but can be biased otherwise. For each of the MNAR assumptions, we instead propose doubly robust targeted maximum likelihood estimators (TMLE), allowing misspecification of either (i) the outcome models or (ii) the exposure and missingness models. The proposed methods are suitable for any outcome types, and we apply them to a motivating study that examines the effect of prescription opioid usage on all-cause mortality using data from the National Health and Nutrition Examination Survey (NHANES).

## Full-text entities

- **Genes:** PELP1 (proline, glutamate and leucine rich protein 1) [NCBI Gene 27043] {aka MNAR, P160}
- **Diseases:** MI (MESH:C580424), A L. (MESH:D007926), Death (MESH:D003643), opioid overdoses (MESH:D000083682), chronic pain (MESH:D059350), TMLE-B. (MESH:D006509), addiction (MESH:D019966), pain (MESH:D010146), drug misuse (MESH:D009293)
- **Chemicals:** pentazocine (MESH:D010423), oxycodone (MESH:D010098), alcohol (MESH:D000438), morphine (MESH:D009020), codeine (MESH:D003061), fentanyl (MESH:D005283)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12922761/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12922761/full.md

## References

74 references — full list in the complete paper: https://tomesphere.com/paper/PMC12922761/full.md

---
Source: https://tomesphere.com/paper/PMC12922761