Raptor: Scalable Train-Free Embeddings for 3D Medical Volumes Leveraging Pretrained 2D Foundation Models

Ulzee An; Moonseong Jeong; Simon A. Lee; Aditya Gorla; Yuzhe Yang; Sriram Sankararaman

arXiv:2507.08254·eess.IV·July 14, 2025

Raptor: Scalable Train-Free Embeddings for 3D Medical Volumes Leveraging Pretrained 2D Foundation Models

Ulzee An, Moonseong Jeong, Simon A. Lee, Aditya Gorla, Yuzhe Yang, Sriram Sankararaman

PDF

1 Video

TL;DR

Raptor introduces a train-free, scalable method for creating meaningful embeddings of 3D medical images by leveraging pretrained 2D models and random projections, outperforming existing methods without additional training.

Contribution

It proposes Raptor, a novel train-free approach that uses pretrained 2D models and random projections to efficiently generate embeddings for volumetric medical data.

Findings

01

Outperforms state-of-the-art medical volume methods (+3% SuPreM, +6% MISFM, +10% Merlin, +13% VoCo, +14% SLIViT)

02

Reduces computational complexity significantly

03

Eliminates the need for costly training on volumetric data

Abstract

Current challenges in developing foundational models for volumetric imaging data, such as magnetic resonance imaging (MRI), stem from the computational complexity of training state-of-the-art architectures in high dimensions and curating sufficiently large datasets of volumes. To address these challenges, we introduce Raptor (Random Planar Tensor Reduction), a train-free method for generating semantically rich embeddings for volumetric data. Raptor leverages a frozen 2D foundation model, pretrained on natural images, to extract visual tokens from individual cross-sections of medical volumes. These tokens are then spatially compressed using random projections, significantly reducing computational complexity while retaining semantic information. Extensive experiments on ten diverse medical volume tasks verify the superior performance of Raptor over state-of-the-art methods, including…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Raptor: Scalable Train-Free Embeddings for 3D Medical Volumes Leveraging Pretrained 2D Foundation Models· slideslive