Nodule-Aligned Latent Space Learning with LLM-Driven Multimodal Diffusion for Lung Nodule Progression Prediction
James Song, Yifan Wang, Chuan Zhou, Liyue Shen

TL;DR
This paper introduces NAMD, a novel multimodal diffusion framework that predicts lung nodule progression by generating follow-up images conditioned on baseline scans and patient data, improving early diagnosis accuracy.
Contribution
NAMD is the first to use a nodule-aligned latent space combined with LLM-driven control for lung nodule progression prediction from multimodal data.
Findings
Achieves AUROC of 0.805 for malignancy prediction
Synthesizes follow-up images closely matching real scans
Outperforms baseline and state-of-the-art methods
Abstract
Early diagnosis of lung cancer is challenging due to biological uncertainty and the limited understanding of the biological mechanisms driving nodule progression. To address this, we propose Nodule-Aligned Multimodal (Latent) Diffusion (NAMD), a novel framework that predicts lung nodule progression by generating 1-year follow-up nodule computed tomography images with baseline scans and the patient's and nodule's Electronic Health Record (EHR). NAMD introduces a nodule-aligned latent space, where distances between latents directly correspond to changes in nodule attributes, and utilizes an LLM-driven control mechanism to condition the diffusion backbone on patient data. On the National Lung Screening Trial (NLST) dataset, our method synthesizes follow-up nodule images that achieve an AUROC of 0.805 and an AUPRC of 0.346 for lung nodule malignancy prediction, significantly outperforming…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLung Cancer Diagnosis and Treatment · AI in cancer detection · Radiomics and Machine Learning in Medical Imaging
