Render-FM: A Foundation Model for Real-time Photorealistic Volumetric Rendering

Zhongpai Gao; Meng Zheng; Benjamin Planche; Anwesa Choudhuri; Terrence Chen; Ziyan Wu

arXiv:2505.17338·cs.CV·May 26, 2025

Render-FM: A Foundation Model for Real-time Photorealistic Volumetric Rendering

Zhongpai Gao, Meng Zheng, Benjamin Planche, Anwesa Choudhuri, Terrence Chen, Ziyan Wu

PDF

TL;DR

Render-FM introduces a foundation model that enables real-time, high-quality volumetric rendering of CT scans without per-scan optimization, significantly improving efficiency and clinical applicability.

Contribution

It presents a novel foundation model using large-scale pre-training to achieve real-time, high-fidelity volumetric rendering of CT scans without scene-specific optimization.

Findings

01

Achieves visual fidelity comparable or superior to specialized methods.

02

Reduces rendering preparation time from nearly an hour to seconds.

03

Enables real-time interactive 3D visualization for clinical workflows.

Abstract

Volumetric rendering of Computed Tomography (CT) scans is crucial for visualizing complex 3D anatomical structures in medical imaging. Current high-fidelity approaches, especially neural rendering techniques, require time-consuming per-scene optimization, limiting clinical applicability due to computational demands and poor generalizability. We propose Render-FM, a novel foundation model for direct, real-time volumetric rendering of CT scans. Render-FM employs an encoder-decoder architecture that directly regresses 6D Gaussian Splatting (6DGS) parameters from CT volumes, eliminating per-scan optimization through large-scale pre-training on diverse medical data. By integrating robust feature extraction with the expressive power of 6DGS, our approach efficiently generates high-quality, real-time interactive 3D visualizations across diverse clinical CT data. Experiments demonstrate that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.