Enhanced Lung Cancer Survival Prediction using Semi-Supervised Pseudo-Labeling and Learning from Diverse PET/CT Datasets
Mohammad R. Salmanpour, Arman Gorji, Amin Mousavi, Ali Fathi Jouzdani,, Nima Sanati, Mehdi Maghsudi, Bonnie Leung, Cheryl Ho, Ren Yuan, Arman Rahmim

TL;DR
This study demonstrates that semi-supervised pseudo-labeling using diverse PET/CT datasets significantly improves lung cancer survival prediction accuracy, especially when data is limited, by leveraging deep radiomic features and hybrid machine learning systems.
Contribution
It introduces a novel semi-supervised learning approach with pseudo-labeling from diverse datasets to enhance lung cancer survival prediction using radiomic features.
Findings
SSL outperforms supervised learning with p<0.05
Achieved 0.85 accuracy with DRFs and PCA+MLP
PCA with Gradient Boosting yields c-index of 0.80
Abstract
Objective: This study explores a semi-supervised learning (SSL), pseudo-labeled strategy using diverse datasets to enhance lung cancer (LCa) survival predictions, analyzing Handcrafted and Deep Radiomic Features (HRF/DRF) from PET/CT scans with Hybrid Machine Learning Systems (HMLS). Methods: We collected 199 LCa patients with both PET & CT images, obtained from The Cancer Imaging Archive (TCIA) and our local database, alongside 408 head&neck cancer (HNCa) PET/CT images from TCIA. We extracted 215 HRFs and 1024 DRFs by PySERA and a 3D-Autoencoder, respectively, within the ViSERA software, from segmented primary tumors. The supervised strategy (SL) employed a HMLSs: PCA connected with 4 classifiers on both HRF and DRFs. SSL strategy expanded the datasets by adding 408 pseudo-labeled HNCa cases (labeled by Random Forest algorithm) to 199 LCa cases, using the same HMLSs techniques.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · Lung Cancer Diagnosis and Treatment · Lung Cancer Treatments and Mutations
MethodsPrincipal Components Analysis
