On the Utility of Foundation Models for Fast MRI: Vision-Language-Guided Image Reconstruction

Ruimin Feng; Xingxin He; Ronald Mercer; Zachary Stewart; and Fang Liu

arXiv:2511.19641·cs.CV·November 26, 2025

On the Utility of Foundation Models for Fast MRI: Vision-Language-Guided Image Reconstruction

Ruimin Feng, Xingxin He, Ronald Mercer, Zachary Stewart, and Fang Liu

PDF

Open Access

TL;DR

This paper explores using vision-language foundation models to improve undersampled MRI reconstruction by incorporating high-level semantic information, leading to better anatomical detail preservation and perceptual quality.

Contribution

It introduces a semantic distribution-guided reconstruction framework that leverages pre-trained vision-language models and contrastive learning to enhance MRI image quality with semantic priors.

Findings

01

Semantic priors improve anatomical detail preservation.

02

Image-language priors enable high-level control over reconstructions.

03

Contrastive objective aligns features with semantic distributions effectively.

Abstract

Purpose: To investigate whether a vision-language foundation model can enhance undersampled MRI reconstruction by providing high-level contextual information beyond conventional priors. Methods: We proposed a semantic distribution-guided reconstruction framework that uses a pre-trained vision-language foundation model to encode both the reconstructed image and auxiliary information into high-level semantic features. A contrastive objective aligns the reconstructed representation with the target semantic distribution, ensuring consistency with high-level perceptual cues. The proposed objective works with various deep learning-based reconstruction methods and can flexibly incorporate semantic priors from multimodal sources. To test the effectiveness of these semantic priors, we evaluated reconstruction results guided by priors derived from either image-only or image-language auxiliary…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced MRI Techniques and Applications · Generative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning