Controlling Level of Unconsciousness by Titrating Propofol with Deep   Reinforcement Learning

Gabe Schamberg; Marcus Badgeley; and Emery N. Brown

arXiv:2008.12333·cs.LG·September 10, 2020

Controlling Level of Unconsciousness by Titrating Propofol with Deep Reinforcement Learning

Gabe Schamberg, Marcus Badgeley, and Emery N. Brown

PDF

TL;DR

This paper demonstrates the first application of deep reinforcement learning for closed-loop control of anesthetic dosing, achieving superior performance over traditional controllers in simulated environments.

Contribution

It introduces a deep RL approach for titrating propofol to control unconsciousness, utilizing continuous input modeling for robustness and interpretability.

Findings

01

Deep RL outperforms PID controllers in simulation.

02

Modeling continuous variables improves robustness.

03

Deep RL policies are smooth and interpretable.

Abstract

Reinforcement Learning (RL) can be used to fit a mapping from patient state to a medication regimen. Prior studies have used deterministic and value-based tabular learning to learn a propofol dose from an observed anesthetic state. Deep RL replaces the table with a deep neural network and has been used to learn medication regimens from registry databases. Here we perform the first application of deep RL to closed-loop control of anesthetic dosing in a simulated environment. We use the cross-entropy method to train a deep neural network to map an observed anesthetic state to a probability of infusing a fixed propofol dosage. During testing, we implement a deterministic policy that transforms the probability of infusion to a continuous infusion rate. The model is trained and tested on simulated pharmacokinetic/pharmacodynamic models with randomized parameters to ensure robustness to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.