# Uncovering distinct clinical phenotypes in disseminated intravascular coagulation through machine learning-enabled cluster analysis

**Authors:** Qingbo Zeng, Junjie Zeng, Qingwei Lin, Lincui Zhong, Longping He, Jingchun Song

PMC · DOI: 10.3389/fmolb.2026.1699476 · 2026-02-25

## TL;DR

This study uses machine learning to identify two distinct subtypes of disseminated intravascular coagulation (DIC) with different severity and outcomes.

## Contribution

The novel use of unsupervised machine learning to stratify DIC patients into clinically meaningful subtypes.

## Key findings

- Two distinct DIC subtypes were identified: mild and severe coagulation dysfunction.
- The severe subtype was associated with significantly higher 7-day and 28-day mortality risks.
- Subtypes showed differences in both model variables and other clinical parameters like heart rate and blood pressure.

## Abstract

Disseminated intravascular coagulation (DIC) is a critical condition encountered in the intensive care unit (ICU), characterized by multiple etiologies and variable outcomes. Distinguishing between DIC phenotypes poses a significant challenge. This study aims to apply unsupervised machine learning (ML) algorithms to stratify DIC patients, thereby enabling more personalized treatment approaches.

We conducted a retrospective analysis of patients diagnosed with DIC upon admission to the ICU at a comprehensive teaching tertiary hospital in China, spanning from May 2015 to November 2022. We applied an unsupervised machine learning approach for consensus clustering using the R package Consensus Cluster Plus to identify clinical phenotypes in 134 patients with DIC. The analysis incorporated the key variables: Thrombin-Antithrombin Complex (TAT), Plasmin-α2-Plasmin Inhibitor Complex (PIC), tissue plasminogen activator-inhibitor complex (tPAIC), and thrombomodulin (TM). The elbow method, cumulative distribution function (CDF) plot, and consensus matrix were employed to ascertain the optimal number of clusters. Logistic regression (LR) analysis was used to investigate the association between the identified phenotypes and clinical endpoints.

The consensus cluster analysis delineated two distinct subtypes: a mild coagulation dysfunction subtype (n = 79) and a severe coagulation dysfunction subtype (n = 55). Notable differences were observed in both variables included in the analysis (e.g., thrombin-antithrombin complex [TAT], P < 0.05) and those not utilized for model training (e.g., heart rate [HR] P < 0.05 and systolic blood pressure [SBP] P < 0.05). Logistic regression revealed that the severe coagulation dysfunction subtype was significantly associated with increased odds of 7-day (OR 4.71; 95% CI 2.23–9.98; P < 0.001), 28-day (OR 2.29; 95% CI 1.11–4.72; P = 0.024).

The study identified two clusters with distinct laboratory profiles and mortality risk.

## Linked entities

- **Diseases:** disseminated intravascular coagulation (MONDO:0001243), DIC (MONDO:0001243)

## Full-text entities

- **Genes:** PLG (plasminogen) [NCBI Gene 5340] {aka HAE4}, THBD (thrombomodulin) [NCBI Gene 7056] {aka AHUS6, BDCA-3, BDCA3, CD141, THPH12, THRM}, GPHA2 (glycoprotein hormone subunit alpha 2) [NCBI Gene 170589] {aka A2, GPA2, ZSIG51}, F2 (coagulation factor II, thrombin) [NCBI Gene 2147] {aka PT, RPRGL2, THPH1}, SERPINC1 (serpin family C member 1) [NCBI Gene 462] {aka AT3, AT3D, ATIII, ATIII-R2, ATIII-T1, ATIII-T2}
- **Diseases:** DIC (MESH:D004211), coagulation dysfunction (MESH:D001778)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12975424/full.md

---
Source: https://tomesphere.com/paper/PMC12975424