OMG-RL:Offline Model-based Guided Reward Learning for Heparin Treatment
Yooseok Lim, Sujee Lee

TL;DR
This paper introduces OMG-RL, an offline model-based inverse reinforcement learning method that learns clinician-like reward functions from limited data to optimize heparin dosing, improving treatment policies.
Contribution
It develops a novel offline IRL approach to capture expert intentions without explicit rewards, applicable to medication dosing tasks.
Findings
OMG-RL improves policy alignment with clinician intentions
The learned reward correlates with key clinical indicators
Demonstrates effectiveness in heparin dosing task
Abstract
Accurate medication dosing holds an important position in the overall patient therapeutic process. Therefore, much research has been conducted to develop optimal administration strategy based on Reinforcement learning (RL). However, Relying solely on a few explicitly defined reward functions makes it difficult to learn a treatment strategy that encompasses the diverse characteristics of various patients. Moreover, the multitude of drugs utilized in clinical practice makes it infeasible to construct a dedicated reward function for each medication. Here, we tried to develop a reward network that captures clinicians' therapeutic intentions, departing from explicit rewards, and to derive an optimal heparin dosing policy. In this study, we introduce Offline Model-based Guided Reward Learning (OMG-RL), which performs offline inverse RL (IRL). Through OMG-RL, we learn a parameterized reward…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHeparin-Induced Thrombocytopenia and Thrombosis · Venous Thromboembolism Diagnosis and Management
