Robust Hybrid Learning for Estimating Personalized Dynamic Treatment Regimens
Ying Liu, Yuanjia Wang, Michael R. Kosorok, Yingqi Zhao, Donglin Zeng

TL;DR
This paper introduces a robust hybrid method called AMOL that combines outcome-weighted learning and Q-learning to improve the estimation of personalized dynamic treatment regimens from SMART trial data, accommodating complex, heterogeneous, and multi-stage treatment scenarios.
Contribution
The paper proposes a novel AMOL method that generalizes outcome-weighted learning, reduces variability, and incorporates doubly robust augmentation for better DTR estimation, even with model misspecification.
Findings
AMOL outperforms existing methods in simulations.
AMOL provides more stable and efficient treatment rules.
Application to real SMART data demonstrates practical utility.
Abstract
Dynamic treatment regimens (DTRs) are sequential decision rules tailored at each stage by potentially time-varying patient features and intermediate outcomes observed in previous stages. The complexity, patient heterogeneity and chronicity of many diseases and disorders call for learning optimal DTRs which best dynamically tailor treatment to each individual's response over time. Proliferation of personalized data (e.g., genetic and imaging data) provides opportunities for deep tailoring as well as new challenges for statistical methodology. In this work, we propose a robust hybrid approach referred as Augmented Multistage Outcome-Weighted Learning (AMOL) to integrate outcome-weighted learning and Q-learning to identify optimal DTRs from the Sequential Multiple Assignment Randomization Trials (SMARTs). We generalize outcome weighted learning (O-learning; Zhao et al.~2012) to allow for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsControl Systems and Identification · Statistical Methods and Inference · Advanced Causal Inference Techniques
