Near-Equivalent Q-learning Policies for Dynamic Treatment Regimes
Sophia Yazzourh, Erica E.M. Moodie

TL;DR
This paper extends Q-learning for dynamic treatment regimes by allowing multiple near-optimal policies within a controlled performance margin, revealing treatment indifference regions and providing more flexible decision options in precision medicine.
Contribution
It introduces an $oldsymbol{ extit{ extbf{ε}}}$-tolerance criterion to generate sets of near-optimal policies, shifting from a single policy to multiple admissible strategies in dynamic treatment decision-making.
Findings
Identifies regions of treatment indifference in decision boundaries.
Constructs sets of near-optimal policies with controlled deviation.
Demonstrates the approach in oncology treatment simulations.
Abstract
Precision medicine aims to tailor therapeutic decisions to individual patient characteristics. This objective is commonly formalized through dynamic treatment regimes, which use statistical and machine learning methods to derive sequential decision rules adapted to evolving clinical information. In most existing formulations, these approaches produce a single optimal treatment at each stage, leading to a unique decision sequence. However, in many clinical settings, several treatment options may yield similar expected outcomes, and focusing on a single optimal policy may conceal meaningful alternatives. We extend the Q-learning framework for retrospective data by introducing a worst-value tolerance criterion controlled by a hyperparameter , which specifies the maximum acceptable deviation from the optimal expected value. Rather than identifying a single optimal policy, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Causal Inference Techniques · Mathematical Biology Tumor Growth · Statistical Methods in Clinical Trials
