CheXPO: Preference Optimization for Chest X-ray VLMs with Counterfactual Rationale
Xiao Liang, Jiawei Hu, Di Wang, Zhi Ma, Lin Zhao, Ronghan Li, Bo Wan, Quan Wang

TL;DR
CheXPO introduces a preference optimization method for chest X-ray vision-language models that leverages counterfactual rationales and similarity-based hard example mining to improve reliability and performance with minimal expert annotation.
Contribution
It proposes a novel CheXPO strategy combining confidence analysis, similarity retrieval, and counterfactual rationales for effective preference optimization in medical VLMs.
Findings
Achieves 8.93% relative performance gain with only 5% of training samples.
Reaches state-of-the-art results across multiple clinical tasks.
Provides a scalable, interpretable approach for radiology applications.
Abstract
Vision-language models (VLMs) are prone to hallucinations that critically compromise reliability in medical applications. While preference optimization can mitigate these hallucinations through clinical feedback, its implementation faces challenges such as clinically irrelevant training samples, imbalanced data distributions, and prohibitive expert annotation costs. To address these challenges, we introduce CheXPO, a Chest X-ray Preference Optimization strategy that combines confidence-similarity joint mining with counterfactual rationale. Our approach begins by synthesizing a unified, fine-grained multi-task chest X-ray visual instruction dataset across different question types for supervised fine-tuning (SFT). We then identify hard examples through token-level confidence analysis of SFT failures and use similarity-based retrieval to expand hard examples for balancing preference sample…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · COVID-19 diagnosis using AI · Advanced Neural Network Applications
