# Subspace Clustering of Physiological Data From Acute Traumatic Brain Injury Patients: Retrospective Analysis Based on the PROTECT III Trial

**Authors:** Sina Ehsani, Chandan K Reddy, Brandon Foreman, Jonathan Ratcliff, Vignesh Subbian

PMC · DOI: 10.2196/24698 · JMIR Biomedical Engineering · 2021-02-02

## TL;DR

This study uses machine learning to find patterns in data from traumatic brain injury patients, helping to better understand and classify their conditions.

## Contribution

The study demonstrates the feasibility of subspace clustering for identifying physiological patterns in TBI patients.

## Key findings

- Three clusters of laboratory physiological data were identified, including INR, chloride, creatinine, hemoglobin, and hematocrit.
- Density-based clustering algorithms showed higher accuracy in classifying patients by mortality status.
- Clustering approaches can aid in defining and validating phenotypes in TBI research.

## Abstract

With advances in digital health technologies and proliferation of biomedical data in recent years, applications of machine learning in health care and medicine have gained considerable attention. While inpatient settings are equipped to generate rich clinical data from patients, there is a dearth of actionable information that can be used for pursuing secondary research for specific clinical conditions.

This study focused on applying unsupervised machine learning techniques for traumatic brain injury (TBI), which is the leading cause of death and disability among children and adults aged less than 44 years. Specifically, we present a case study to demonstrate the feasibility and applicability of subspace clustering techniques for extracting patterns from data collected from TBI patients.

Data for this study were obtained from the Progesterone for Traumatic Brain Injury, Experimental Clinical Treatment–Phase III (PROTECT III) trial, which included a cohort of 882 TBI patients. We applied subspace-clustering methods (density-based, cell-based, and clustering-oriented methods) to this data set and compared the performance of the different clustering methods.

The analyses showed the following three clusters of laboratory physiological data: (1) international normalized ratio (INR), (2) INR, chloride, and creatinine, and (3) hemoglobin and hematocrit. While all subclustering algorithms had a reasonable accuracy in classifying patients by mortality status, the density-based algorithm had a higher F1 score and coverage.

Clustering approaches serve as an important step for phenotype definition and validation in clinical domains such as TBI, where patient and injury heterogeneity are among the major reasons for failure of clinical trials. The results from this study provide a foundation to develop scalable clustering algorithms for further research and validation.

## Linked entities

- **Diseases:** traumatic brain injury (MONDO:0858950)

## Full-text entities

- **Diseases:** TBI (MESH:D000070642), death and disability (MESH:D003643)
- **Chemicals:** Progesterone (MESH:D011374), creatinine (MESH:D003404), chloride (MESH:D002712)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11041422/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11041422/full.md

## References

29 references — full list in the complete paper: https://tomesphere.com/paper/PMC11041422/full.md

---
Source: https://tomesphere.com/paper/PMC11041422