# A multimodal embedding model for sepsis data representation

**Authors:** Tuo Liu, Yonglin Li, Hongyi Chen, Naiqing Li, Yan Zhang, Xuanqi Huang, Jin Wang, Rui Chen, Yuping Zeng, Yuntao Liu, Danwen Zheng, Darong Wu, Changdong Wang, Tao Yu, Xiaotu Xi, Zhongde Zhang

PMC · DOI: 10.1038/s41746-026-02446-3 · NPJ Digital Medicine · 2026-02-23

## TL;DR

This paper introduces SepsisDRM, a new model that combines tabular and text data to better understand and predict outcomes in sepsis patients.

## Contribution

SepsisDRM is the first embedding model specifically designed for sepsis, integrating tabular and textual data for improved generalization.

## Key findings

- SepsisDRM effectively stratifies patients into four clinically interpretable phenotypes.
- The model achieves strong AUC scores for 28-day outcome prediction across different datasets.
- It generalizes well without task-specific tuning, showing robust performance in diverse sepsis-related tasks.

## Abstract

Sepsis research has long been constrained by limited labeled data and models designed for specific tasks that primarily rely on tabular inputs, overlooking the valuable insights contained in clinical text. To address these limitations, we propose the Sepsis Data Representation Model (SepsisDRM), an embedding model that jointly processes tabular and textual data to capture comprehensive patient representations. Trained on a dataset comprising 19,526 sepsis patients, SepsisDRM demonstrates strong generalization across diverse sepsis-related tasks without task-specific tuning. It effectively stratifies patients into four clinically interpretable phenotypes and achieves robust performance in predicting 28-day outcomes, with AUC scores of 0.92, 0.94, and 0.78 on retrospective, prospective, and external datasets, respectively. As the first embedding model developed specifically for sepsis, SepsisDRM establishes a novel paradigm for sepsis research and offers a promising approach for studies in other fields that involve the integration of both tabular and textual data.

## Full-text entities

- **Diseases:** Sepsis (MESH:D018805)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13039152/full.md

## Figures

15 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13039152/full.md

## References

21 references — full list in the complete paper: https://tomesphere.com/paper/PMC13039152/full.md

---
Source: https://tomesphere.com/paper/PMC13039152