# Summary Report of the SNMMI AI Task Force Radiomics Challenge 2024

**Authors:** Ronald Boellaard, Arman Rahmim, Jacoba J. Eertink, Ulrich Duehrsen, Lars Kurch, Pieternella J. Lugtenburg, Sanne E. Wiegers, Gerben J.C. Zwezerijnen, Josée M. Zijlstra, Martijn W. Heymans, Irène Buvat

PMC · DOI: 10.2967/jnumed.124.269425 · Journal of Nuclear Medicine · 2025-08-01

## TL;DR

This paper summarizes a challenge comparing machine-learning models for predicting survival in diffuse large B-cell lymphoma patients using PET/CT radiomics data.

## Contribution

The study evaluates the effectiveness of advanced radiomic features and machine learning over simple models for survival prediction.

## Key findings

- Nineteen models for predicting continuous PFS were submitted, with six performing similarly to a simple reference model.
- Only one binary outcome model showed marginally better performance than a basic logistic regression model.
- Sophisticated radiomic features and machine learning added limited value compared to simple models in this dataset.

## Abstract

In medical imaging, challenges are competitions that aim to provide a fair comparison of different methodologic solutions to a common problem. Challenges typically focus on addressing real-world problems, such as segmentation, detection, and prediction tasks, using various types of medical images and associated data. Here, we describe the organization and results of such a challenge to compare machine-learning models for predicting survival in patients with diffuse large B-cell lymphoma using a baseline 18F-FDG PET/CT radiomics dataset. Methods: This challenge aimed to predict progression-free survival (PFS) in patients with diffuse large B-cell lymphoma, either as a binary outcome (shorter than 2 y versus longer than 2 y) or as a continuous outcome (survival in months). All participants were provided with a radiomic training dataset, including the ground truth survival for designing a predictive model and a radiomic test dataset without ground truth. Figures of merit (FOMs) used to assess model performance were the root-mean-square error for continuous outcomes and the C-index for 1-, 2-, and 3-y PFS binary outcomes. The challenge was endorsed and initiated by the Society of Nuclear Medicine and Molecular Imaging AI Task Force. Results: Nineteen models for predicting PFS as a continuous outcome from 15 teams were received. Among those models, external validation identified 6 models showing similar performance to that of a simple general linear reference model using SUV and total metabolic tumor volumes (TMTV) only. Twelve models for predicting binary outcomes were submitted by 9 teams. External validation showed that 1 model had higher, but nonsignificant, C-index values compared with values obtained by a simple logistic regression model using SUV and TMTV. Conclusion: Some of the radiomic-based machine-learning models developed by participants showed better FOMs than did simple linear or logistic regression models based on SUV and TMTV only, although the differences in observed FOMs were nonsignificant. This suggests that, for the challenge dataset, there was limited or no value seen from the addition of sophisticated radiomic features and use of machine learning when developing models for outcome prediction.

## Linked entities

- **Diseases:** diffuse large B-cell lymphoma (MONDO:0018905)

## Full-text entities

- **Diseases:** tumor (MESH:D009369), diffuse large B-cell lymphoma (MESH:D016403)
- **Chemicals:** F-FDG (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12320580/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12320580/full.md

## References

24 references — full list in the complete paper: https://tomesphere.com/paper/PMC12320580/full.md

---
Source: https://tomesphere.com/paper/PMC12320580