Feature Robustness in Non-stationary Health Records: Caveats to Deployable Model Performance in Common Clinical Machine Learning Tasks
Bret Nestor, Matthew B. A. McDermott, Willie Boag, Gabriela Berner,, Tristan Naumann, Michael C. Hughes, Anna Goldenberg, Marzyeh Ghassemi

TL;DR
This paper highlights the challenge of model performance decay over time in clinical EHR prediction tasks and proposes feature aggregation into clinical concepts as an effective mitigation strategy to improve robustness against data drift.
Contribution
The study reveals the impact of temporal data shifts on model accuracy and introduces a simple feature aggregation method that enhances model robustness in non-stationary clinical data.
Findings
Models trained on historical data decay in performance when tested on future data.
Aggregating features into clinical concepts reduces performance decay significantly.
The proposed aggregation method outperforms automatic feature preprocessing techniques.
Abstract
When training clinical prediction models from electronic health records (EHRs), a key concern should be a model's ability to sustain performance over time when deployed, even as care practices, database systems, and population demographics evolve. Due to de-identification requirements, however, current experimental practices for public EHR benchmarks (such as the MIMIC-III critical care dataset) are time agnostic, assigning care records to train or test sets without regard for the actual dates of care. As a result, current benchmarks cannot assess how well models trained on one year generalise to another. In this work, we obtain a Limited Data Use Agreement to access year of care for each record in MIMIC and show that all tested state-of-the-art models decay in prediction quality when trained on historical data and tested on future data, particularly in response to a system-wide…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Artificial Intelligence in Healthcare · Time Series Analysis and Forecasting
