Exploring Time-Step Size in Reinforcement Learning for Sepsis Treatment
Yingchuan Sun, Shengpu Tang

TL;DR
This study empirically compares different time-step sizes in reinforcement learning for sepsis treatment, revealing that smaller steps generally improve policy performance and stability, challenging the conventional 4-hour standard.
Contribution
The paper systematically evaluates the impact of various time-step sizes on RL performance in sepsis management, introducing action re-mapping for fair comparison across different temporal granularities.
Findings
Finer time-steps ($$h, 2h) yield better policy performance.
Performance trends vary with learning setups and time-step sizes.
Smaller time-steps improve stability and policy quality.
Abstract
Existing studies on reinforcement learning (RL) for sepsis management have mostly followed an established problem setup, in which patient data are aggregated into 4-hour time steps. Although concerns have been raised regarding the coarseness of this time-step size, which might distort patient dynamics and lead to suboptimal treatment policies, the extent to which this is a problem in practice remains unexplored. In this work, we conducted empirical experiments for a controlled comparison of four time-step sizes ( h) on this domain, following an identical offline RL pipeline. To enable a fair comparison across time-step sizes, we designed action re-mapping methods that allow for evaluation of policies on datasets with different time-step sizes, and conducted cross- model selections under two policy learning setups. Our goal was to quantify how time-step…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSepsis Diagnosis and Treatment · Machine Learning in Healthcare · Intensive Care Unit Cognitive Disorders
