Handling and Interpreting Missing Modalities in Patient Clinical Trajectories via Autoregressive Sequence Modeling
Andrew Wang, Ellie Pavlick, Ritambhara Singh

TL;DR
This paper introduces a sequence modeling approach using causal decoders from large language models to handle missing modalities in multimodal healthcare data, improving interpretability and performance.
Contribution
It proposes a missingness-aware contrastive pre-training method and autoregressive transformer models for better handling missing data in clinical trajectories.
Findings
Outperforms baselines on MIMIC-IV and eICU benchmarks.
Contrastive pre-training mitigates divergent behavior caused by missing modalities.
Provides interpretability techniques to understand modality removal effects.
Abstract
An active challenge in developing multimodal machine learning (ML) models for healthcare is handling missing modalities during training and deployment. As clinical datasets are inherently temporal and sparse in terms of modality presence, capturing the underlying predictive signal via diagnostic multimodal ML models while retaining model explainability remains an ongoing challenge. In this work, we address this by re-framing clinical diagnosis as an autoregressive sequence modeling task, utilizing causal decoders from large language models (LLMs) to model a patient's multimodal trajectory. We first introduce a missingness-aware contrastive pre-training objective that integrates multiple modalities in datasets with missingness in a shared latent space. We then show that autoregressive sequence modeling with transformer-based architectures outperforms baselines on the MIMIC-IV and eICU…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
