PET2Rep: Towards Vision-Language Model-Drived Automated Radiology Report Generation for Positron Emission Tomography

Yichi Zhang; Wenbo Zhang; Zehui Ling; Gang Feng; Sisi Peng; Deshu Chen; Yuchen Liu; Hongwei Zhang; Shuqi Wang; Lanlan Li; Limei Han; Yuan Cheng; Zixin Hu; Yuan Qi; Le Xue

arXiv:2508.04062·eess.IV·November 13, 2025

PET2Rep: Towards Vision-Language Model-Drived Automated Radiology Report Generation for Positron Emission Tomography

Yichi Zhang, Wenbo Zhang, Zehui Ling, Gang Feng, Sisi Peng, Deshu Chen, Yuchen Liu, Hongwei Zhang, Shuqi Wang, Lanlan Li, Limei Han, Yuan Cheng, Zixin Hu, Yuan Qi, Le Xue

PDF

1 Datasets

TL;DR

This paper introduces PET2Rep, a comprehensive benchmark dataset for evaluating vision-language models in automated PET radiology report generation, highlighting current models' limitations and the need for specialized medical VLMs.

Contribution

PET2Rep is the first large-scale PET-specific dataset for radiology report generation, enabling evaluation of general and medical VLMs on metabolic PET images.

Findings

01

Current VLMs perform poorly on PET report generation

02

Existing models fall short of clinical needs

03

Identified key areas for improvement in medical VLMs

Abstract

Positron emission tomography (PET) is a cornerstone of modern oncologic and neurologic imaging, distinguished by its unique ability to illuminate dynamic metabolic processes that transcend the anatomical focus of traditional imaging technologies. Radiology reports are essential for clinical decision making, yet their manual creation is labor-intensive and time-consuming. Recent advancements of vision-language models (VLMs) have shown strong potential in medical applications, presenting a promising avenue for automating report generation. However, existing applications of VLMs in the medical domain have predominantly focused on structural imaging modalities, while the unique characteristics of molecular PET imaging have largely been overlooked. To bridge the gap, we introduce PET2Rep, a large-scale comprehensive benchmark for evaluation of general and medical VLMs for radiology report…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

r1pples/PET2Rep
dataset· 10 dl
10 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.