Estimating and Improving Dynamic Treatment Regimes With a Time-Varying Instrumental Variable
Shuxiao Chen, Bo Zhang

TL;DR
This paper develops a novel framework for estimating and improving dynamic treatment regimes using time-varying instrumental variables, addressing unmeasured confounding and enhancing policy performance in observational studies.
Contribution
It introduces a Bellman equation under partial identification, defines IV-optimal DTRs, and proposes a method for IV-improvement to enhance existing treatment strategies.
Findings
IV-optimal DTRs outperform those under no unmeasured confounding assumptions
IV-improved DTRs can strictly improve upon baseline policies
Simulation studies demonstrate superior performance of proposed methods
Abstract
Estimating dynamic treatment regimes (DTRs) from retrospective observational data is challenging as some degree of unmeasured confounding is often expected. In this work, we develop a framework of estimating properly defined "optimal" DTRs with a time-varying instrumental variable (IV) when unmeasured covariates confound the treatment and outcome, rendering the potential outcome distributions only partially identified. We derive a novel Bellman equation under partial identification, use it to define a generic class of estimands (termed IV-optimal DTRs), and study the associated estimation problem. We then extend the IV-optimality framework to tackle the policy improvement problem, delivering IV-improved DTRs that are guaranteed to perform no worse and potentially better than a pre-specified baseline DTR. Importantly, our IV-improvement framework opens up the possibility of strictly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
