# A holistic framework for intradialytic hypotension prediction using generative adversarial networks-based data balancing

**Authors:** Hsuan-Ming Lin, JrJung Lyu

PMC · DOI: 10.1186/s12911-025-03094-5 · 2025-07-10

## TL;DR

This study introduces a new method using generative adversarial networks to balance data for predicting intradialytic hypotension, improving prediction accuracy over traditional methods.

## Contribution

The novel contribution is an enhanced CWGAN-GP framework for generating synthetic data to address class imbalance in IDH prediction.

## Key findings

- The GAN Balanced dataset achieved the highest predictive performance with statistically significant improvements in PR-AUC and Accuracy.
- SHAP analysis identified key predictors like Dialysis Date and hemodynamic indicators for IDH.
- Traditional methods like SMOTE and ADASYN underperformed compared to the proposed GAN-based approach.

## Abstract

Intradialytic Hypotension (IDH) is a frequent complication in hemodialysis, yet predictive modeling is challenged by class imbalance. Traditional oversampling methods often struggle with complex clinical data. This study evaluates an enhanced conditional Wasserstein Generative Adversarial Network with Gradient Penalty (CWGAN-GP) framework to improve IDH prediction by generating high-utility synthetic data for balancing.

A CWGAN-GP was developed using multi-level hemodialysis data. Following rigorous preprocessing, including a strict temporal train-test split, the CWGAN-GP generated minority class samples exclusively on the training data. eXtreme Gradient Boosting (XGBoost) models were trained on the original imbalanced data and datasets balanced using the proposed CWGAN-GP method, benchmarked against traditional Synthetic Minority Over-sampling Technique(SMOTE) and Adaptive Synthetic Sampling Approach(ADASYN) balancing. Performance was evaluated using metrics sensitive to imbalance (e.g., Precision-Recall Area Under the Curve) and statistical comparisons, with SHapley Additive exPlanations (SHAP) analysis for interpretability.

The study population consisted of 40 chronic hemodialysis patients (45% male, mean age 66.30\documentclass[12pt]{minimal}
				\usepackage{amsmath}
				\usepackage{wasysym} 
				\usepackage{amsfonts} 
				\usepackage{amssymb} 
				\usepackage{amsbsy}
				\usepackage{mathrsfs}
				\usepackage{upgreek}
				\setlength{\oddsidemargin}{-69pt}
				\begin{document}$$\:\pm$$\end{document} 10.68 years). An initial dataset, where intradialytic hypotension (IDH) events occurred in 14.85% of records (19,124 instances overall), was temporally split (75:25 ratio). This yielded an Original Training dataset of 95,856 samples (14.73% IDH rate) and a test set (15.21% IDH rate). From this Original Training dataset, a Generative Adversarial Network (GAN) was employed to construct a balanced dataset comprising 163,470 samples. The GAN Balanced dataset yielded the highest predictive performance, demonstrating statistically significant improvements over the Original Training dataset across metrics, including Precision-Recall Area Under the Curve (PR-AUC) (mean 0.735 vs 0.724) and Accuracy (mean 0.900 vs 0.892). In contrast, the GAN Augmented dataset (191,712 samples) showed mixed results (improved Accuracy/F1, decreased Receiver Operating Characteristic Curve Area Under Curve (ROC-AUC)/PR-AUC). In comparison, ADASYN (163,326 samples) and SMOTE (163,470 samples) balanced datasets significantly underperformed on PR-AUC. SHAP analysis identified Dialysis Date (as a proxy for temporal patterns like day-of-week) and hemodynamic indicators (e.g., Systolic Diastolic Difference, Previous Systolic Pressure) as key IDH predictors.

The proposed CWGAN-GP framework effectively balances complex hemodialysis data, leading to significantly improved and interpretable IDH prediction models compared to standard approaches. This work supports leveraging advanced generative models like GAN to overcome data imbalance in clinical prediction tasks, which is pending further validation.

The online version contains supplementary material available at 10.1186/s12911-025-03094-5.

## Full-text entities

- **Diseases:** IDH (MESH:D007022)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12243283/full.md

---
Source: https://tomesphere.com/paper/PMC12243283