# Comparison of Deep Learning Architectures for Cardiac Contour Segmentation in Catheterization Radiographs

**Authors:** Kian A Huang, Bharath Subramanian, Haris K Choudhary, Aziz U Rehman, Neelesh S Prakash

PMC · DOI: 10.7759/cureus.103341 · Cureus · 2026-02-10

## TL;DR

This study compares U-Net and DeepLabV3 for segmenting cardiac contours in catheterization radiographs, finding U-Net to be more accurate.

## Contribution

The paper provides a direct empirical comparison of U-Net and DeepLabV3 for cardiac silhouette segmentation in catheterization radiographs.

## Key findings

- U-Net outperformed DeepLabV3 in Dice similarity coefficient, IoU, and pixel accuracy.
- Statistical analysis confirmed large effect sizes favoring U-Net across all metrics.
- U-Net's encoder-decoder architecture with skip connections improved boundary delineation.

## Abstract

Introduction

Accurate cardiac image segmentation is essential for quantitative assessment of cardiac anatomy and function. Manual segmentation, while considered the reference standard, is time-consuming and subject to inter- and intra-observer variability. Deep learning (DL) models, particularly convolutional neural networks (CNNs), have shown promise for automating this process. Among these, U-Net and DeepLabV3 are widely used architectures; however, direct comparisons between them for cardiac silhouette segmentation on cardiac catheterization radiographs are limited.

Methods

A supervised DL study was conducted using 1717 anonymized chest radiographs during cardiac catheterization with corresponding binary segmentation masks of the cardiac silhouette. Images were resized to 256 × 256 pixels, normalized, and divided into training (70%), validation (20%), and test (10%) sets. Two segmentation models were implemented: (1) a modified U-Net with batch normalization and dropout regularization, and (2) a DeepLabV3 network with a MobileNetV2 backbone pretrained on ImageNet. Both models were trained using the Adam optimizer and binary cross-entropy loss with early stopping based on validation performance. Segmentation performance was evaluated on the test set using the Dice similarity coefficient, Intersection over Union (IoU), and pixel accuracy. Statistical comparisons were performed using the Wilcoxon signed-rank test, and effect sizes were calculated using the rank-biserial correlation (RBC).

Results

U-Net consistently outperformed DeepLabV3 across all evaluation metrics. U-Net achieved a mean Dice of 0.9454, IoU of 0.8980, and pixel accuracy of 0.9844, compared to DeepLabV3 (Dice = 0.9321, IoU = 0.8742, pixel accuracy = 0.9806). These differences were statistically significant (Dice: W = 2849.0, p < 1 × 10⁻¹², RBC = 0.617; IoU: W = 2810.0, p < 1 × 10⁻¹², RBC = 0.622; pixel accuracy: W = 2768.5, p < 1 × 10⁻¹², RBC = 0.628), indicating large effect sizes favoring U-Net.

Conclusions

U-Net achieved significantly higher segmentation accuracy than DeepLabV3 for cardiac silhouette segmentation on catheterization radiographs. Its encoder-decoder architecture with skip connections enabled superior boundary delineation, reaffirming U-Net as a strong baseline for medical image segmentation. Automated cardiac silhouette segmentation may enhance the efficiency and reproducibility of cardiothoracic ratio estimation, cardiomegaly detection, and longitudinal cardiac monitoring. Future studies should focus on multi-institutional validation, chamber-level segmentation, and benchmarking against transformer-based architectures to advance clinical integration.

## Full-text entities

- **Diseases:** cardiomegaly (MESH:D006332)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12980657/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12980657/full.md

## References

16 references — full list in the complete paper: https://tomesphere.com/paper/PMC12980657/full.md

---
Source: https://tomesphere.com/paper/PMC12980657