CheXPO-v2: Preference Optimization for Chest X-ray VLMs with Knowledge Graph Consistency

Xiao Liang; Yuxuan An; Di Wang; Jiawei Hu; Zhicheng Jiao; Bin Jing; Quan Wang

arXiv:2512.17213·cs.CV·December 22, 2025

CheXPO-v2: Preference Optimization for Chest X-ray VLMs with Knowledge Graph Consistency

Xiao Liang, Yuxuan An, Di Wang, Jiawei Hu, Zhicheng Jiao, Bin Jing, Quan Wang

PDF

Open Access 2 Models

TL;DR

CheXPO-v2 introduces a knowledge graph-based reward system for aligning medical vision-language models, significantly improving factual accuracy and reasoning transparency in chest X-ray analysis with high data efficiency.

Contribution

It presents a novel process supervision framework using entity-relation matching, enhancing model reliability and interpretability over outcome-based methods.

Findings

01

Outperforms GRPO and state-of-the-art models on MIMIC-CXR-VQA

02

Achieves new state-of-the-art accuracy with only 5k samples

03

Produces clinically sound, verifiable reasoning

Abstract

Medical Vision-Language Models (VLMs) are prone to hallucinations, compromising clinical reliability. While reinforcement learning methods like Group Relative Policy Optimization (GRPO) offer a low-cost alignment solution, their reliance on sparse, outcome-based rewards inadvertently encourages models to "overthink" -- generating verbose, convoluted, and unverifiable Chain-of-Thought reasoning to justify answers. This focus on outcomes obscures factual errors and poses significant safety risks. To address this, we propose CheXPO-v2, a novel alignment framework that shifts from outcome to process supervision. Our core innovation is a Knowledge Graph Consistency Reward mechanism driven by Entity-Relation Matching. By explicitly parsing reasoning steps into structured "Disease, Relation, Anatomy" triplets, we provide fine-grained supervision that penalizes incoherent logic and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Machine Learning in Healthcare · Advanced Graph Neural Networks