Attend and Diagnose: Clinical Time Series Analysis using Attention Models
Huan Song, Deepta Rajan, Jayaraman J. Thiagarajan, Andreas Spanias

TL;DR
This paper introduces SAnD, a novel attention-based neural network architecture that models clinical time-series data without recurrence, achieving state-of-the-art results on MIMIC-III benchmarks and improving efficiency over traditional RNNs.
Contribution
The paper pioneers the use of attention mechanisms for clinical time-series analysis, replacing RNNs and demonstrating superior performance and efficiency.
Findings
SAnD outperforms LSTM models on all clinical prediction tasks.
The attention-based model achieves state-of-the-art results on MIMIC-III datasets.
The approach is more computationally efficient than recurrent architectures.
Abstract
With widespread adoption of electronic health records, there is an increased emphasis for predictive models that can effectively deal with clinical time-series data. Powered by Recurrent Neural Network (RNN) architectures with Long Short-Term Memory (LSTM) units, deep neural networks have achieved state-of-the-art results in several clinical prediction tasks. Despite the success of RNNs, its sequential nature prohibits parallelized computing, thus making it inefficient particularly when processing long sequences. Recently, architectures which are based solely on attention mechanisms have shown remarkable success in transduction tasks in NLP, while being computationally superior. In this paper, for the first time, we utilize attention models for clinical time-series modeling, thereby dispensing recurrence entirely. We develop the \textit{SAnD} (Simply Attend and Diagnose) architecture,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Topic Modeling · Time Series Analysis and Forecasting
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
