Learning to Diagnose with LSTM Recurrent Neural Networks
Zachary C. Lipton, David C. Kale, Charles Elkan, Randall Wetzel

TL;DR
This paper demonstrates that LSTM recurrent neural networks can effectively classify multiple diagnoses from irregular, multivariate clinical time series data, outperforming traditional methods.
Contribution
It is the first empirical evaluation of LSTMs on clinical time series data, showing their superiority over baseline models and proposing a simple training strategy.
Findings
LSTM models outperform baseline methods in diagnosis classification.
Replicating targets at each sequence step improves training effectiveness.
LSTMs effectively handle irregularly sampled clinical data.
Abstract
Clinical medical data, especially in the intensive care unit (ICU), consist of multivariate time series of observations. For each patient visit (or episode), sensor data and lab test results are recorded in the patient's Electronic Health Record (EHR). While potentially containing a wealth of insights, the data is difficult to mine effectively, owing to varying length, irregular sampling and missing data. Recurrent Neural Networks (RNNs), particularly those using Long Short-Term Memory (LSTM) hidden units, are powerful and increasingly popular models for learning from sequence data. They effectively model varying length sequences and capture long range dependencies. We present the first study to empirically evaluate the ability of LSTMs to recognize patterns in multivariate time series of clinical measurements. Specifically, we consider multilabel classification of diagnoses, training a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Time Series Analysis and Forecasting · Topic Modeling
