Learning to Adapt Clinical Sequences with Residual Mixture of Experts

Jeong Min Lee; Milos Hauskrecht

arXiv:2204.02687·cs.LG·April 7, 2022

Learning to Adapt Clinical Sequences with Residual Mixture of Experts

Jeong Min Lee, Milos Hauskrecht

PDF

Open Access 1 Repo

TL;DR

This paper introduces a residual Mixture-of-Experts architecture using multiple RNNs to better model the heterogeneity in clinical event sequences from EHRs, improving prediction accuracy over single models.

Contribution

It proposes a novel residual MoE approach that refines a pretrained base RNN with multiple expert RNNs to adapt to patient sub-populations.

Findings

01

Achieved 4.1% gain in AUPRC over single RNN models

02

Effectively models patient heterogeneity in clinical sequences

03

Demonstrates improved predictive performance on real-world EHR data

Abstract

Clinical event sequences in Electronic Health Records (EHRs) record detailed information about the patient condition and patient care as they occur in time. Recent years have witnessed increased interest of machine learning community in developing machine learning models solving different types of problems defined upon information in EHRs. More recently, neural sequential models, such as RNN and LSTM, became popular and widely applied models for representing patient sequence data and for predicting future events or outcomes based on such data. However, a single neural sequential model may not properly represent complex dynamics of all patients and the differences in their behaviors. In this work, we aim to alleviate this limitation by refining a one-fits-all model using a Mixture-of-Experts (MoE) architecture. The architecture consists of multiple (expert) RNN models covering patient…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

leej35/residual-moe
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Electronic Health Records Systems · Artificial Intelligence in Healthcare

MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory · Balanced Selection · Gated Recurrent Unit