# Automatic uterus segmentation in transvaginal ultrasound using U-Net and nnU-Net

**Authors:** Dilara Tank, Bianca G. S. Schor, Lisa M. Trommelen, Judith A. F. Huirne, Iacer Calixto, Robert A. de Leeuw, Diego Raimondo, Paolo Cazzaniga, Paolo Cazzaniga

PMC · DOI: 10.1371/journal.pone.0336237 · PLOS One · 2025-11-12

## TL;DR

This paper compares U-Net and nnU-Net for segmenting the uterus in transvaginal ultrasound images, finding that nnU-Net performs better and benefits from 3D volume screenshots.

## Contribution

The study introduces and evaluates U-Net and nnU-Net for uterus segmentation in TVUS, highlighting the benefits of nnU-Net and specific imaging types.

## Key findings

- nnU-Net outperformed U-Net across all imaging types in TVUS uterus segmentation.
- Training on specific imaging types (still images, video screenshots, 3D volume screenshots) improved segmentation performance.
- 3D volume screenshots provided the best results for nnU-Net due to reduced clutter.

## Abstract

Transvaginal ultrasound (TVUS) is pivotal for diagnosing reproductive pathologies in individuals assigned female at birth, often serving as the primary imaging method for gynecologic evaluation. Despite recent advancements in AI-driven segmentation, its application to gynecological ultrasound still needs further attention. Our study aims to bridge this gap by training and evaluating two state-of-the-art deep learning (DL) segmentation models on TVUS data.

An experienced gynecological expert manually segmented the uterus in our TVUS dataset of 124 patients with adenomyosis, comprising still images (n = 122), video screenshots (n = 472), and 3D volume screenshots (n = 452). Two popular DL segmentation models, U-Net and nnU-Net, were trained on the entire dataset, and each imaging type was trained separately. Optimization for U-Net included varying batch size, image resolution, pre-processing, and augmentation. Model performance was measured using the Dice score (DSC).

U-Net and nnU-Net had good mean segmentation performances on the TVUS uterus segmentation dataset (0.75 to 0.97 DSC). We observed that training on specific imaging types (still images, video screenshots, 3D volume screenshots) tended to yield better segmentation performance than training on the complete dataset for both models. Furthermore, nnU-Net outperformed the U-Net across all imaging types. Lastly, we report the best results using the U-Net model with limited pre-processing and augmentations.

TVUS datasets are well-suited for DL-based segmentation. nnU-Net training was faster and yielded higher segmentation performance; thus, it is recommended over manual U-Net tuning. We also recommend creating TVUS datasets that include only one imaging type and are as clutter-free as possible. The nnU-Net strongly benefited from being trained on 3D volume screenshots in our dataset, likely due to their lack of clutter. Further validation is needed to confirm the robustness of these models on TVUS datasets. Our code is available on https://github.com/dilaratank/UtiSeg.

## Linked entities

- **Diseases:** adenomyosis (MONDO:0010888)

## Full-text entities

- **Diseases:** adenomyosis (MESH:D062788)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12611126/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12611126/full.md

## References

35 references — full list in the complete paper: https://tomesphere.com/paper/PMC12611126/full.md

---
Source: https://tomesphere.com/paper/PMC12611126