A downsampling strategy to assess the predictive value of radiomic features
AS Dirand, F Frouin, I Buvat

TL;DR
This paper introduces a downsampling strategy to evaluate whether radiomic features contain relevant information for classification tasks, especially when data is limited, by predicting performance metrics as if using larger datasets.
Contribution
A novel downsampling method is proposed to estimate the predictive value of radiomic features for large datasets from smaller samples.
Findings
Downsampling accurately predicts performance metrics for large datasets.
Multivariate models improve with more data, univariate models do not.
The method helps identify when radiomic features lack relevant information.
Abstract
Many studies are devoted to the design of radiomic models for a prediction task. When no effective model is found, it is often difficult to know whether the radiomic features do not include information relevant to the task or because of insufficient data. We propose a downsampling method to answer that question when considering a classification task into two groups. Using two large patient cohorts, several experimental configurations involving different numbers of patients were created. Univariate or multivariate radiomic models were designed from each configuration. Their performance as reflected by the Youden index (YI) and Area Under the receiver operating characteristic Curve (AUC) was compared to the stable performance obtained with the highest number of patients. A downsampling method is described to predict the YI and AUC achievable with a large number of patients. Using the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · Gastric Cancer Management and Outcomes · AI in cancer detection
