Optimization of anemia treatment in hemodialysis patients via   reinforcement learning

Pablo Escandell-Montero; Milena Chermisi; Jos\'e M.; Mart\'inez-Mart\'inez; Juan G\'omez-Sanchis; Carlo Barbieri; Emilio; Soria-Olivas; Flavio Mari; Joan Vila-Franc\'es; Andrea Stopper; Emanuele; Gatti; Jos\'e D. Mart\'in-Guerrero

arXiv:1509.03977·stat.ML·September 15, 2015·Artif. Intell. Medicine

Optimization of anemia treatment in hemodialysis patients via reinforcement learning

Pablo Escandell-Montero, Milena Chermisi, Jos\'e M., Mart\'inez-Mart\'inez, Juan G\'omez-Sanchis, Carlo Barbieri, Emilio, Soria-Olivas, Flavio Mari, Joan Vila-Franc\'es, Andrea Stopper, Emanuele, Gatti, Jos\'e D. Mart\'in-Guerrero

PDF

TL;DR

This paper presents a reinforcement learning approach to optimize anemia treatment in hemodialysis patients, aiming to improve hemoglobin stability over standard protocols by accounting for patient variability.

Contribution

It introduces a fitted Q iteration-based RL methodology for ESA dosing optimization, outperforming traditional Q-learning and standard protocols in simulations.

Findings

01

FQI outperforms Q-learning and standard protocols in simulations.

02

RL-based method shows potential for personalized anemia treatment.

03

Simulation results indicate improved hemoglobin level stability.

Abstract

Objective: Anemia is a frequent comorbidity in hemodialysis patients that can be successfully treated by administering erythropoiesis-stimulating agents (ESAs). ESAs dosing is currently based on clinical protocols that often do not account for the high inter- and intra-individual variability in the patient's response. As a result, the hemoglobin level of some patients oscillates around the target range, which is associated with multiple risks and side-effects. This work proposes a methodology based on reinforcement learning (RL) to optimize ESA therapy. Methods: RL is a data-driven approach for solving sequential decision-making problems that are formulated as Markov decision processes (MDPs). Computing optimal drug administration strategies for chronic diseases is a sequential decision-making problem in which the goal is to find the best sequence of drug doses. MDPs are particularly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsQ-Learning