Reflective Instruction Tuning: Mitigating Hallucinations in Large   Vision-Language Models

Jinrui Zhang; Teng Wang; Haigang Zhang; Ping Lu; Feng Zheng

arXiv:2407.11422·cs.CV·July 17, 2024·1 cites

Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models

Jinrui Zhang, Teng Wang, Haigang Zhang, Ping Lu, Feng Zheng

PDF

Open Access

TL;DR

This paper introduces reflective instruction tuning with rationale learning to reduce hallucinations in large vision-language models, using a new dataset REVERIE to improve reasoning and alignment.

Contribution

It proposes a novel training method that incorporates rationale prediction, enhancing reasoning and reducing hallucinations in LVLMs, supported by a large annotated dataset REVERIE.

Findings

01

Improved performance on multiple LVLM benchmarks.

02

Enhanced reasoning capabilities in models.

03

Reduction in hallucination instances.

Abstract

Large vision-language models (LVLMs) have shown promising performance on a variety of vision-language tasks. However, they remain susceptible to hallucinations, generating outputs misaligned with visual content or instructions. While various mitigation strategies have been proposed, they often neglect a key contributor to hallucinations: lack of fine-grained reasoning supervision during training. Without intermediate reasoning steps, models may establish superficial shortcuts between instructions and responses, failing to internalize the inherent reasoning logic. To address this challenge, we propose reflective instruction tuning, which integrates rationale learning into visual instruction tuning. Unlike previous methods that learning from responses only, our approach entails the model predicting rationales justifying why responses are correct or incorrect. This fosters a deeper…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEpilepsy research and treatment