Handling and Interpreting Missing Modalities in Patient Clinical Trajectories via Autoregressive Sequence Modeling

Andrew Wang; Ellie Pavlick; Ritambhara Singh

arXiv:2604.18753·cs.LG·May 8, 2026

Handling and Interpreting Missing Modalities in Patient Clinical Trajectories via Autoregressive Sequence Modeling

Andrew Wang, Ellie Pavlick, Ritambhara Singh

PDF

TL;DR

This paper introduces a sequence modeling approach using causal decoders from large language models to handle missing modalities in multimodal healthcare data, improving interpretability and performance.

Contribution

It proposes a missingness-aware contrastive pre-training method and autoregressive transformer models for better handling missing data in clinical trajectories.

Findings

01

Outperforms baselines on MIMIC-IV and eICU benchmarks.

02

Contrastive pre-training mitigates divergent behavior caused by missing modalities.

03

Provides interpretability techniques to understand modality removal effects.

Abstract

An active challenge in developing multimodal machine learning (ML) models for healthcare is handling missing modalities during training and deployment. As clinical datasets are inherently temporal and sparse in terms of modality presence, capturing the underlying predictive signal via diagnostic multimodal ML models while retaining model explainability remains an ongoing challenge. In this work, we address this by re-framing clinical diagnosis as an autoregressive sequence modeling task, utilizing causal decoders from large language models (LLMs) to model a patient's multimodal trajectory. We first introduce a missingness-aware contrastive pre-training objective that integrates multiple modalities in datasets with missingness in a shared latent space. We then show that autoregressive sequence modeling with transformer-based architectures outperforms baselines on the MIMIC-IV and eICU…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.