Deep Offline Reinforcement Learning for Real-world Treatment   Optimization Applications

Milashini Nambiar; Supriyo Ghosh; Priscilla Ong; Yu En Chan; and Yong Mong Bee; Pavitra Krishnaswamy

arXiv:2302.07549·cs.LG·June 14, 2023·1 cites

Deep Offline Reinforcement Learning for Real-world Treatment Optimization Applications

Milashini Nambiar, Supriyo Ghosh, Priscilla Ong, Yu En Chan, and Yong Mong Bee, Pavitra Krishnaswamy

PDF

Open Access

TL;DR

This paper introduces a novel offline reinforcement learning method tailored for real-world medical treatment optimization, addressing challenges like action imbalance and safety constraints, and demonstrates its effectiveness on diabetes and sepsis datasets.

Contribution

The study proposes a practical transition sampling approach for offline RL that improves treatment decision quality in safety-critical healthcare applications.

Findings

01

Significant improvement in expected health outcomes.

02

Outperforms baseline methods like DDQN and CQL.

03

Aligns with clinical safety and practice guidelines.

Abstract

There is increasing interest in data-driven approaches for recommending optimal treatment strategies in many chronic disease management and critical care applications. Reinforcement learning methods are well-suited to this sequential decision-making problem, but must be trained and evaluated exclusively on retrospective medical record datasets as direct online exploration is unsafe and infeasible. Despite this requirement, the vast majority of treatment optimization studies use off-policy RL methods (e.g., Double Deep Q Networks (DDQN) or its variants) that are known to perform poorly in purely offline settings. Recent advances in offline RL, such as Conservative Q-Learning (CQL), offer a suitable alternative. But there remain challenges in adapting these approaches to real-world applications where suboptimal examples dominate the retrospective dataset and strict safety constraints need…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Frailty in Older Adults · Sepsis Diagnosis and Treatment

MethodsQ-Learning