# Using Machine Learning to Improve Control for Confounding in the Dynamic Weighted Ordinary Least Squares Estimator of Optimal Adaptive Treatment Strategies

**Authors:** Kossi Clément Trenou, Miceline Mésidor, Aida Eslami, Hermann Nabi, Caroline Diorio, Denis Talbot

PMC · DOI: 10.1002/bimj.70068 · 2025-07-29

## TL;DR

This paper explores using machine learning in a statistical method to better estimate personalized treatment strategies, showing improved performance over traditional models.

## Contribution

The paper introduces the use of machine learning within the dWOLS estimator for adaptive treatment strategies, demonstrating its effectiveness.

## Key findings

- SuperLearner performed as well as logistic regression in simple scenarios and better in complex ones.
- The m-out-of-n bootstrap method produced accurate confidence intervals for low-bias estimates.
- The method was successfully applied to breast cancer data for personalized hormonal therapy recommendations.

## Abstract

Estimating optimal adaptive treatment strategies (ATSs) can be done in several ways, including dynamic weighted ordinary least squares (dWOLS). This approach is doubly robust as it requires modeling both the treatment and the response, but only one of those models needs to be correctly specified to obtain a consistent estimator. For estimating an average treatment effect, doubly robust methods have been shown to combine better with machine learning methods than alternatives. However, the use of machine learning within dWOLS has not yet been investigated. Using simulation studies, we evaluate and compare the performance of the dWOLS estimator when the treatment probability is estimated either using machine learning algorithms or a logistic regression model. We further investigate the use of an adaptive m‐out‐of‐n bootstrap method for producing inferences. SuperLearner performed at least as well as logistic regression in terms of bias and variance in scenarios with simple data‐generating models and often had improved performance in more complex scenarios. Moreover, the m‐out‐of‐n bootstrap produced confidence intervals with nominal coverage probabilities for parameters that were estimated with low bias. We also apply our proposed approach to the data from a breast cancer registry in Québec, Canada, to estimate an optimal ATS to personalize the use of hormonal therapy in breast cancer patients. Our method is implemented in the R software and available on GitHub https://github.com/kosstre20/MachineLearningToControlConfoundingPersonalizedMedicine.git. We recommend routine use of machine learning to model treatment within dWOLS, at least as a sensitivity analysis for the point estimates.

## Linked entities

- **Diseases:** breast cancer (MONDO:0004989)

## Full-text entities

- **Diseases:** breast cancer (MESH:D001943)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC12305482/full.md

---
Source: https://tomesphere.com/paper/PMC12305482