iTRIALSPACE: Programmable Virtual Lesion Trials for Controlled Evaluation of Lung CT Models
Fakrul Islam Tushar, Umme Hafsa Momy, Joseph Y. Lo, Geoffrey D. Rubin

TL;DR
iTRIALSPACE is a flexible framework for controlled, virtual lung CT lesion trials that enables detailed evaluation of model accuracy and robustness beyond traditional static benchmarks.
Contribution
It introduces a novel pipeline for synthesizing controlled lung CT datasets using real clinical data and lesion profiles, facilitating more precise model assessment.
Findings
Synthetic CTs maintain real-to-real FID baseline.
Performance rankings transfer with high correlation ($\rho$=0.93).
Controlled modes reveal insights like size prediction collapse and variance ratios.
Abstract
We introduce iTRIALSPACE, a programmable evaluation framework for controlled assessment of lung CT models. Standard benchmarks are static retrospective collections that entangle lesion size, lobe prevalence, anatomy, and acquisition context, making it difficult to determine what structurally drives model accuracy. iTRIALSPACE addresses this limitation by composing real clinical CTs and lesion profiles into controlled virtual lesion trials through a four-stage pipeline: multidataset nodule profiling, explicit trial specification, anatomy-aware mask insertion, and ControlNet-conditioned CT synthesis. The framework is built on a unified 54-attribute nodule-profile dataset spanning 13,140 annotated nodules from seven public CT sources and instantiated as 13 trial modes. We evaluate iTRIALSPACE in a 55,469-sample Virtual Lesion Study spanning three medical VLMs, four spatialguidance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
