Optimizing LVLMs with On-Policy Data for Effective Hallucination Mitigation
Chengzhi Yu, Yifan Xu, Yifan Chen, Wenyi Zhang

TL;DR
This paper introduces a novel approach to reduce hallucinations in large vision-language models by leveraging on-policy data, a hallucination classifier, and a dynamic preference optimization algorithm, significantly improving model reliability.
Contribution
The authors propose an effective method for hallucination mitigation in LVLMs using on-policy data, a binary hallucination classifier, and an iterative preference optimization scheme, outperforming existing methods.
Findings
Halts hallucination rate by over 50% on MMHalBench
Reduces hallucination by 79.5% on Object HalBench
Enables open-source LVLMs to surpass GPT-4V performance
Abstract
Recently, large vision-language models (LVLMs) have risen to be a promising approach for multimodal tasks. However, principled hallucination mitigation remains a critical challenge.In this work, we first analyze the data generation process in LVLM hallucination mitigation and affirm that on-policy data significantly outperforms off-policy data, which thus calls for efficient and reliable preference annotation of on-policy data. We then point out that, existing annotation methods introduce additional hallucination in training samples, which may enhance the model's hallucination patterns, to address this problem, we propose training a hallucination classifier giving binary annotations, which guarantee clean chosen samples for the subsequent alignment. To further harness of the power of on-policy data, we design a robust iterative direct preference optimization (DPO) algorithm adopting a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
