Time-to-Event Pretraining for 3D Medical Imaging
Zepeng Huo, Jason Alan Fries, Alejandro Lozano, Jeya Maria Jose, Valanarasu, Ethan Steinberg, Louis Blankemeier, Akshay S. Chaudhari, Curtis, Langlotz, and Nigam H. Shah

TL;DR
This paper introduces time-to-event pretraining for 3D medical imaging models, leveraging longitudinal EHR data to improve disease outcome prediction without losing diagnostic accuracy.
Contribution
It presents a novel pretraining framework that incorporates temporal supervision from EHRs, bridging the gap between imaging biomarkers and long-term health outcomes.
Findings
Achieved 23.7% average AUROC improvement
Gained 29.4% in Harrell's C-index across benchmarks
Enhanced outcome prediction without affecting diagnostic tasks
Abstract
With the rise of medical foundation models and the growing availability of imaging data, scalable pretraining techniques offer a promising way to identify imaging biomarkers predictive of future disease risk. While current self-supervised methods for 3D medical imaging models capture local structural features like organ morphology, they fail to link pixel biomarkers with long-term health outcomes due to a missing context problem. Current approaches lack the temporal context necessary to identify biomarkers correlated with disease progression, as they rely on supervision derived only from images and concurrent text descriptions. To address this, we introduce time-to-event pretraining, a pretraining framework for 3D medical imaging models that leverages large-scale temporal supervision from paired, longitudinal electronic health records (EHRs). Using a dataset of 18,945 CT scans (4.2…
Peer Reviews
Decision·ICLR 2025 Poster
1. Innovative Approach: The method creatively leverages EHR data following a medical scan to assist model pretraining, demonstrating better performance compared to imaging-only pretraining. 2. Comprehensive Evaluation: Extensive comparisons across multiple tasks validate the robustness and efficiency of the TTE-based approach across different architectures.
1. Dependence on Large EHR Datasets: This approach relies on extensive, high-quality EHR data, which many medical datasets do not include. 2. Limited Modality Scope: Tested only on CT images; broader modality testing could validate versatility across imaging types. 3. Interpretability: The TTE pretraining’s impact on specific pixel-level biomarkers is less clear; additional analysis on feature attribution could help.
- Propose utilizing the time events as pre-training tasks specially designed for prognosis tasks in downstream applications. - The manuscript is overall easy to follow
- The proposed method is limited in generalization since it will require longitudinal time-to-event EHR data as the supervision for the pre-training. In comparison to the common self-supervised pre-training, the proposed methods are harder to scale up. - There is no comparison evaluation between the proposed method and prior methods in model pre-training. Only the results of the proposed method with different model architectures are reported. It will be difficult to appreciate the benefits of t
- The presentation quality is very high. Care has been taken to logically organize the paper, clearly articulate key points, and straightforwardly present results with concise figures and tables. - The core idea is creative, making use of the wealth of longitudinal EHR data associated with each 3D volume for pretraining. - Discussion or related work and background is particularly strong. - Experiments are sufficiently thorough and easy to interpret – results are convincing.
- The actual description of the TTE pretraining approach is brief (lines 184-191) and somewhat unclear. I would advise the authors to flesh out this section. See specific questions below. - A description or list of the 8,192 EHR pretraining tasks is never provided. I’m aware there may not be a convenient place to list this many items, but a general description of categories of events or a few illustrative examples would be helpful. Without this information, it’s impossible to assess whether, e.g
Code & Models
Videos
Taxonomy
TopicsAdvanced Radiotherapy Techniques · Digital Radiography and Breast Imaging · Medical Imaging Techniques and Applications
