A Contextual-bandit-based Approach for Informed Decision-making in   Clinical Trials

Yogatheesan Varatharajah; Brent Berry; Sanmi Koyejo; and Ravishankar; Iyer

arXiv:1809.00258·cs.AI·September 10, 2018·1 cites

A Contextual-bandit-based Approach for Informed Decision-making in Clinical Trials

Yogatheesan Varatharajah, Brent Berry, Sanmi Koyejo, and Ravishankar, Iyer

PDF

Open Access

TL;DR

This paper introduces a contextual-bandit algorithm for clinical trial treatment assignment that considers patient characteristics, outperforming traditional methods and previous bandit approaches in optimizing individual patient outcomes.

Contribution

The paper presents a novel contextual-bandit-based method for adaptive treatment assignment in clinical trials, accounting for patient variability and improving outcome optimization.

Findings

01

The proposed method outperforms random treatment assignment.

02

It achieves 72.63% gains over random assignment.

03

It surpasses previous multi-arm bandit approaches with 64.34% gains.

Abstract

Clinical trials involving multiple treatments utilize randomization of the treatment assignments to enable the evaluation of treatment efficacies in an unbiased manner. Such evaluation is performed in post hoc studies that usually use supervised-learning methods that rely on large amounts of data collected in a randomized fashion. That approach often proves to be suboptimal in that some participants may suffer and even die as a result of having not received the most appropriate treatments during the trial. Reinforcement-learning methods improve the situation by making it possible to learn the treatment efficacies dynamically during the course of the trial, and to adapt treatment assignments accordingly. Recent efforts using \textit{multi-arm bandits}, a type of reinforcement-learning methods, have focused on maximizing clinical outcomes for a population that was assumed to be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Causal Inference Techniques · Advanced Bandit Algorithms Research · Health Systems, Economic Evaluations, Quality of Life