Optimizing LVLMs with On-Policy Data for Effective Hallucination Mitigation

Chengzhi Yu; Yifan Xu; Yifan Chen; Wenyi Zhang

arXiv:2512.00706·cs.CV·December 2, 2025

Optimizing LVLMs with On-Policy Data for Effective Hallucination Mitigation

Chengzhi Yu, Yifan Xu, Yifan Chen, Wenyi Zhang

PDF

Open Access

TL;DR

This paper introduces a novel approach to reduce hallucinations in large vision-language models by leveraging on-policy data, a hallucination classifier, and a dynamic preference optimization algorithm, significantly improving model reliability.

Contribution

The authors propose an effective method for hallucination mitigation in LVLMs using on-policy data, a binary hallucination classifier, and an iterative preference optimization scheme, outperforming existing methods.

Findings

01

Halts hallucination rate by over 50% on MMHalBench

02

Reduces hallucination by 79.5% on Object HalBench

03

Enables open-source LVLMs to surpass GPT-4V performance

Abstract

Recently, large vision-language models (LVLMs) have risen to be a promising approach for multimodal tasks. However, principled hallucination mitigation remains a critical challenge.In this work, we first analyze the data generation process in LVLM hallucination mitigation and affirm that on-policy data significantly outperforms off-policy data, which thus calls for efficient and reliable preference annotation of on-policy data. We then point out that, existing annotation methods introduce additional hallucination in training samples, which may enhance the model's hallucination patterns, to address this problem, we propose training a hallucination classifier giving binary annotations, which guarantee clean chosen samples for the subsequent alignment. To further harness of the power of on-policy data, we design a robust iterative direct preference optimization (DPO) algorithm adopting a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning