# The impact of pre-processing techniques on deep learning breast image segmentation

**Authors:** Jéssica Catarino, Nuno Cruz Garcia, Sara Silva, João Santinha

PMC · DOI: 10.1038/s41598-025-30724-9 · Scientific Reports · 2025-12-16

## TL;DR

This paper examines how different pre-processing steps affect deep learning models used to segment breast images, aiming to improve accuracy in breast cancer detection.

## Contribution

The study introduces and compares two pre-processing pipelines specifically tailored for breast imaging to enhance segmentation performance.

## Key findings

- Domain-specific pre-processing improves segmentation accuracy compared to non-specific methods.
- Pixel intensity normalization significantly impacts model performance, as shown by a 3-way ANOVA F-test.
- Careful handling of breast imaging metadata preserves anatomical information crucial for accurate segmentation.

## Abstract

Breast cancer is one of the most common forms of cancer worldwide, making breast imaging a critical area for developing and evaluating Deep Learning methods. In this study, we investigate how different pre-processing techniques influence model performance in breast image segmentation. Pre-processing is a crucial step in the Deep Learning pipeline that directly impacts model performance, yet studies on its role in medical imaging remain limited. We assess the influence of different pre-processing techniques on a U-Net segmentation model applied to two breast public imaging datasets: CBIS-DDSM and Duke-Breast-Cancer-MRI. We systematically explored commonly used methods, including pixel intensity normalization, spacing harmonization, resizing/padding, and orientation standardization. Two processing pipelines were developed: Domain Non-Specific, integrating standard practices from natural and medical image analysis, and Domain Specific, which preserves anatomical information through careful handling of breast imaging metadata. A detailed comparative analysis of each pre-processing technique was conducted to evaluate its impact on model performance. Despite challenges and limitations associated with dataset size and scope, our findings identify pre-processing strategies tailored for breast imaging that can improve segmentation accuracy and analysis. This study represents an initial step in evaluating pre-processing for medical image analysis, providing a foundation for future work. Our results highlight significant differences in a 3-way ANOVA F-test (\documentclass[12pt]{minimal}
				\usepackage{amsmath}
				\usepackage{wasysym} 
				\usepackage{amsfonts} 
				\usepackage{amssymb} 
				\usepackage{amsbsy}
				\usepackage{mathrsfs}
				\usepackage{upgreek}
				\setlength{\oddsidemargin}{-69pt}
				\begin{document}$$\alpha < 0.05$$\end{document}) for U-Net segmentation outcomes, attributed to different pixel intensity normalization approaches, offering valuable insights for future research.

## Linked entities

- **Diseases:** breast cancer (MONDO:0004989)

## Full-text entities

- **Diseases:** Breast cancer (MESH:D001943), cancer (MESH:D009369)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12789679/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12789679/full.md

## References

16 references — full list in the complete paper: https://tomesphere.com/paper/PMC12789679/full.md

---
Source: https://tomesphere.com/paper/PMC12789679