# Semiparametric discovery and estimation of interaction in mixed exposures using stochastic interventions

**Authors:** David B. McCoy, Alan Hubbard, Mark van der Laan, Alejandro Schuler

PMC · DOI: 10.1515/jci-2024-0058 · Journal of causal inference · 2026-02-20

## TL;DR

This paper introduces InterXshift, a new method for identifying and estimating interactions among multiple environmental exposures using machine learning and statistical techniques.

## Contribution

The novel semiparametric method InterXshift enables model-independent discovery and estimation of mixed exposure interactions.

## Key findings

- InterXshift accurately identifies true interaction directions in simulations and real-world data.
- The method highlights significant impacts of exposure interactions in environmental health studies.
- InterXshift is applied to NHANES data to assess furan exposure effects on telomere length.

## Abstract

Understanding the complex interactions among multiple environmental exposures is critical for assessing their combined impact on health outcomes. This study introduces InterXshift, a novel semiparametric method that provides a nonparametric definition of interaction and facilitates both the discovery and efficient estimation of interaction effects in mixed exposures. Leveraging stochastic shift interventions and ensemble machine learning, InterXshift identifies and quantifies interactions through a model-independent target parameter, estimated using targeted maximum likelihood estimation (TMLE) and cross-validation. The approach contrasts expected outcomes from joint interventions against those from individual exposures, enabling the detection of synergistic and antagonistic interactions. Validation through simulations and application to the National Institute of Environmental Health Sciences (NIEHS) Mixtures Workshop data demonstrate InterXshift’s efficacy in accurately identifying true interaction directions and consistently highlighting significant impacts. We apply our methodology to National Health and Nutrition Examination Survey (NHANES) data to understand the interaction effect (if any) of furan exposure on leukocyte telomere length. This method enhances the analysis of multi-exposure interactions within high-dimensional datasets, offering robust methodological improvements for elucidating complex exposure dynamics in environmental health research. Additionally, we provide an opensource implementation of InterXshift in the InterXshift R package, facilitating its adoption and application by the research community.

## Linked entities

- **Chemicals:** furan (PubChem CID 8029)

## Full-text entities

- **Genes:** POP1 (POP1 ribonuclease P/MRP subunit) [NCBI Gene 10940] {aka ANXD2}, AHR (aryl hydrocarbon receptor) [NCBI Gene 196] {aka FVH3, RP85, bHLHe76}
- **Diseases:** DGM (MESH:D041781), endocrine disruption (MESH:D004700)
- **Chemicals:** lipid (MESH:D008055), cotinine (MESH:D003367), furan (MESH:C039281), Dioxins (MESH:D004147), 1,2,3,4,6,7,8-hxcdf (-), PCB (MESH:D011078), Furans (MESH:D005663), PCB 126 (MESH:C023035), PCB 118 (MESH:C070055)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12920007/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12920007/full.md

## References

33 references — full list in the complete paper: https://tomesphere.com/paper/PMC12920007/full.md

---
Source: https://tomesphere.com/paper/PMC12920007