# Impact of intensity standardisation and ComBat batch size on clinical-radiomic prognostic models performance in a multi-centre study of patients with glioblastoma

**Authors:** Kavi Fatania, Russell Frood, Hitesh Mistry, Susan C. Short, James O’Connor, Andrew F. Scarsbrook, Stuart Currie

PMC · DOI: 10.1007/s00330-024-11168-7 · European Radiology · 2024-11-28

## TL;DR

This study examines how standardizing MRI intensity and using ComBat batch sizes affects the performance of radiomic models in predicting survival for glioblastoma patients across multiple centers.

## Contribution

The study evaluates the impact of intensity standardisation and ComBat batch size on radiomic model performance in a real-world, multi-centre glioblastoma cohort.

## Key findings

- HM and WS intensity standardisation improved model discrimination and fit but did not significantly affect stability or calibration.
- Larger ComBat batch sizes enhanced model performance but reduced stability and calibration accuracy.
- Heterogeneous data from multiple centers challenges the reproducibility of radiomic survival models.

## Abstract

To assess the effect of different intensity standardisation techniques (ISTs) and ComBat batch sizes on radiomics survival model performance and stability in a heterogenous, multi-centre cohort of patients with glioblastoma (GBM).

Multi-centre pre-operative MRI acquired between 2014 and 2020 in patients with IDH-wildtype unifocal WHO grade 4 GBM were retrospectively evaluated. WhiteStripe (WS), Nyul histogram matching (HM), and Z-score (ZS) ISTs were applied before radiomic feature (RF) extraction. RFs were realigned using ComBat and minimum batch size (MBS) of 5, 10, or 15 patients. Cox proportional hazards models for overall survival (OS) prediction were produced using five different selection strategies and the impact of IST and MBS was evaluated using bootstrapping. Calibration, discrimination, relative explained variation, and model fit were assessed. Instability was evaluated using 95% confidence intervals (95% CIs), feature selection frequency and calibration curves across the bootstrap resamples.

One hundred ninety-five patients were included. Median OS = 13 (95% CI: 12–14) months. Twelve to fourteen unique MRI protocols were used per MRI sequence. HM and WS produced the highest relative increase in model discrimination, explained variation and model fit but IST choice did not greatly impact on stability, nor calibration. Larger ComBat batches improved discrimination, model fit, and explained variation but higher MBS (reduced sample size) reduced stability (across all performance metrics) and reduced calibration accuracy.

Heterogenous, real-world GBM data poses a challenge to the reproducibility of radiomics. ComBat generally improved model performance as MBS increased but reduced stability and calibration. HM and WS tended to improve model performance.

Question
ComBat harmonisation of RFs and intensity standardisation of MRI have not been thoroughly evaluated in multicentre, heterogeneous GBM data.

Findings The addition of ComBat and ISTs can improve discrimination, relative model fit, and explained variance but degrades the calibration and stability of survival models.

Clinical relevance
Radiomics risk prediction models in real-world, multicentre contexts could be improved by ComBat and ISTs, however, this degrades calibration and prediction stability and this must be thoroughly investigated before patients can be accurately separated into different risk groups.

## Linked entities

- **Diseases:** glioblastoma (MONDO:0018177)

## Full-text entities

- **Genes:** IDH1 (isocitrate dehydrogenase (NADP(+)) 1) [NCBI Gene 3417] {aka HEL-216, HEL-S-26, IDCD, IDH, IDP, IDPC}
- **Diseases:** GBM (MESH:D005909)
- **Chemicals:** IST (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12081554/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12081554/full.md

## References

6 references — full list in the complete paper: https://tomesphere.com/paper/PMC12081554/full.md

---
Source: https://tomesphere.com/paper/PMC12081554