Learning to Adapt Clinical Sequences with Residual Mixture of Experts
Jeong Min Lee, Milos Hauskrecht

TL;DR
This paper introduces a residual Mixture-of-Experts architecture using multiple RNNs to better model the heterogeneity in clinical event sequences from EHRs, improving prediction accuracy over single models.
Contribution
It proposes a novel residual MoE approach that refines a pretrained base RNN with multiple expert RNNs to adapt to patient sub-populations.
Findings
Achieved 4.1% gain in AUPRC over single RNN models
Effectively models patient heterogeneity in clinical sequences
Demonstrates improved predictive performance on real-world EHR data
Abstract
Clinical event sequences in Electronic Health Records (EHRs) record detailed information about the patient condition and patient care as they occur in time. Recent years have witnessed increased interest of machine learning community in developing machine learning models solving different types of problems defined upon information in EHRs. More recently, neural sequential models, such as RNN and LSTM, became popular and widely applied models for representing patient sequence data and for predicting future events or outcomes based on such data. However, a single neural sequential model may not properly represent complex dynamics of all patients and the differences in their behaviors. In this work, we aim to alleviate this limitation by refining a one-fits-all model using a Mixture-of-Experts (MoE) architecture. The architecture consists of multiple (expert) RNN models covering patient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Electronic Health Records Systems · Artificial Intelligence in Healthcare
MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory · Balanced Selection · Gated Recurrent Unit
