Stable CDE Autoencoders with Acuity Regularization for Offline Reinforcement Learning in Sepsis Treatment
Yue Gao

TL;DR
This paper demonstrates that stable Controlled Differential Equation (CDE) autoencoders with acuity regularization improve the quality of state representations for offline reinforcement learning in sepsis treatment, leading to better policy performance and meaningful clinical insights.
Contribution
It introduces a training stabilization approach for CDE autoencoders and enforces acuity-aware regularization, advancing representation learning for clinical RL tasks.
Findings
Stable CDE autoencoders produce representations correlated with clinical scores.
Stable training leads to RL policies with high WIS return (> 0.9).
Unstable CDEs result in poor representations and policy failure.
Abstract
Effective reinforcement learning (RL) for sepsis treatment depends on learning stable, clinically meaningful state representations from irregular ICU time series. While previous works have explored representation learning for this task, the critical challenge of training instability in sequential representations and its detrimental impact on policy performance has been overlooked. This work demonstrates that Controlled Differential Equations (CDE) state representation can achieve strong RL policies when two key factors are met: (1) ensuring training stability through early stopping or stabilization methods, and (2) enforcing acuity-aware representations by correlation regularization with clinical scores (SOFA, SAPS-II, OASIS). Experiments on the MIMIC-III sepsis cohort reveal that stable CDE autoencoder produces representations strongly correlated with acuity scores and enables RL…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSepsis Diagnosis and Treatment · ECG Monitoring and Analysis · Non-Invasive Vital Sign Monitoring
