# Robustness of Machine Learning Predictions for Determining Whether Deep Inspiration Breath-Hold Is Required in Breast Cancer Radiation Therapy

**Authors:** Wlla E. Al-Hammad, Masahiro Kuroda, Ghaida Al Jamal, Mamiko Fujikura, Ryo Kamizaki, Kazuhiro Kuroda, Suzuka Yoshida, Yoshihide Nakamura, Masataka Oita, Yoshinori Tanabe, Kohei Sugimoto, Irfan Sugianto, Majd Barham, Nouha Tekiki, Miki Hisatomi, Junichi Asaumi

PMC · DOI: 10.3390/diagnostics15060668 · Diagnostics · 2025-03-10

## TL;DR

This study evaluates how well machine learning models can predict if a deep breath-hold technique is needed during breast cancer radiation therapy to protect the heart.

## Contribution

The study introduces a rigorous evaluation of ML model robustness across different heart dose thresholds and parameter settings in radiation therapy planning.

## Key findings

- The decision tree model showed high robustness at lower heart dose thresholds (240 and 270 cGy).
- The random forest model performed best at a higher threshold (300 cGy).
- The decision tree model was stable and reliable at the critical 240 cGy threshold.

## Abstract

Background/Objectives: Deep inspiration breath-hold (DIBH) is a commonly used technique to reduce the mean heart dose (MHD), which is critical for minimizing late cardiac side effects in breast cancer patients undergoing radiation therapy (RT). Although previous studies have explored the potential of machine learning (ML) to predict which patients might benefit from DIBH, none have rigorously assessed ML model performance across various MHD thresholds and parameter settings. This study aims to evaluate the robustness of ML models in predicting the need for DIBH across different clinical scenarios. Methods: Using data from 207 breast cancer patients treated with RT, we developed and tested ML models at three MHD cut-off values (240, 270, and 300 cGy), considering variations in the number of independent variables (three vs. six) and folds in the cross-validation (three, four, and five). Robustness was defined as achieving high F2 scores and low instability in predictive performance. Results: Our findings indicate that the decision tree (DT) model demonstrated consistently high robustness at 240 and 270 cGy, while the random forest model performed optimally at 300 cGy. At 240 cGy, a threshold critical to minimize late cardiac risks, the DT model exhibited stable predictive power, reducing the risk of overestimating DIBH necessity. Conclusions: These results suggest that the DT model, particularly at lower MHD thresholds, may be the most reliable for clinical applications. By providing a tool for targeted DIBH implementation, this model has the potential to enhance patient-specific treatment planning and improve clinical outcomes in RT.

## Linked entities

- **Diseases:** breast cancer (MONDO:0004989)

## Full-text entities

- **Diseases:** Breast Cancer (MESH:D001943)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11941375/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11941375/full.md

## References

39 references — full list in the complete paper: https://tomesphere.com/paper/PMC11941375/full.md

---
Source: https://tomesphere.com/paper/PMC11941375