Variational Autoencoders for Feature Exploration and Malignancy Prediction of Lung Lesions
Benjamin Keel, Aaron Quyn, David Jayne, Samuel D. Relton

TL;DR
This paper explores the use of Variational Autoencoders to analyze lung lesions from CT scans, achieving high diagnostic accuracy and providing interpretable feature representations that could enhance clinical decision-making.
Contribution
It introduces a novel application of VAEs for lung lesion analysis, compares Gaussian and Dirichlet VAEs for explainability, and demonstrates clinically meaningful latent space traversals.
Findings
VAE-based models achieved AUC 0.98 and 93.1% accuracy in lung cancer diagnosis.
Latent space clustering separates benign and malignant lesions based on meaningful features.
Dirichlet VAE offers more explainable and disentangled feature representations.
Abstract
Lung cancer is responsible for 21% of cancer deaths in the UK and five-year survival rates are heavily influenced by the stage the cancer was identified at. Recent studies have demonstrated the capability of AI methods for accurate and early diagnosis of lung cancer from routine scans. However, this evidence has not translated into clinical practice with one barrier being a lack of interpretable models. This study investigates the application Variational Autoencoders (VAEs), a type of generative AI model, to lung cancer lesions. Proposed models were trained on lesions extracted from 3D CT scans in the LIDC-IDRI public dataset. Latent vector representations of 2D slices produced by the VAEs were explored through clustering to justify their quality and used in an MLP classifier model for lung cancer diagnosis, the best model achieved state-of-the-art metrics of AUC 0.98 and 93.1%…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · AI in cancer detection · Cancer-related molecular mechanisms research
