A Contextual-bandit-based Approach for Informed Decision-making in Clinical Trials
Yogatheesan Varatharajah, Brent Berry, Sanmi Koyejo, and Ravishankar, Iyer

TL;DR
This paper introduces a contextual-bandit algorithm for clinical trial treatment assignment that considers patient characteristics, outperforming traditional methods and previous bandit approaches in optimizing individual patient outcomes.
Contribution
The paper presents a novel contextual-bandit-based method for adaptive treatment assignment in clinical trials, accounting for patient variability and improving outcome optimization.
Findings
The proposed method outperforms random treatment assignment.
It achieves 72.63% gains over random assignment.
It surpasses previous multi-arm bandit approaches with 64.34% gains.
Abstract
Clinical trials involving multiple treatments utilize randomization of the treatment assignments to enable the evaluation of treatment efficacies in an unbiased manner. Such evaluation is performed in post hoc studies that usually use supervised-learning methods that rely on large amounts of data collected in a randomized fashion. That approach often proves to be suboptimal in that some participants may suffer and even die as a result of having not received the most appropriate treatments during the trial. Reinforcement-learning methods improve the situation by making it possible to learn the treatment efficacies dynamically during the course of the trial, and to adapt treatment assignments accordingly. Recent efforts using \textit{multi-arm bandits}, a type of reinforcement-learning methods, have focused on maximizing clinical outcomes for a population that was assumed to be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Causal Inference Techniques · Advanced Bandit Algorithms Research · Health Systems, Economic Evaluations, Quality of Life
