Robust Real-Time Mortality Prediction in the Intensive Care Unit using Temporal Difference Learning
Thomas Frost, Kezhi Li, Steve Harris

TL;DR
This paper introduces a TD learning framework for real-time ICU mortality prediction, demonstrating improved robustness over traditional methods, especially with high-variance irregular time series data.
Contribution
It develops a novel framework applying TD learning to healthcare data, addressing variance issues and enhancing prediction robustness in ICU mortality models.
Findings
TD learning improves model robustness in mortality prediction
The approach maintains performance on external datasets
Outperforms traditional supervised learning methods in irregular data scenarios
Abstract
The task of predicting long-term patient outcomes using supervised machine learning is a challenging one, in part because of the high variance of each patient's trajectory, which can result in the model over-fitting to the training data. Temporal difference (TD) learning, a common reinforcement learning technique, may reduce variance by generalising learning to the pattern of state transitions rather than terminal outcomes. However, in healthcare this method requires several strong assumptions about patient states, and there appears to be limited literature evaluating the performance of TD learning against traditional supervised learning methods for long-term health outcome prediction tasks. In this study, we define a framework for applying TD learning to real-time irregularly sampled time series data using a Semi-Markov Reward Process. We evaluate the model framework in predicting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare
