Critique Before Thinking: Mitigating Hallucination through Rationale-Augmented Instruction Tuning

Zexian Yang; Dian Li; Dayan Wu; Gang Liu; Weiping Wang

arXiv:2505.07172·cs.CV·May 13, 2025

Critique Before Thinking: Mitigating Hallucination through Rationale-Augmented Instruction Tuning

Zexian Yang, Dian Li, Dayan Wu, Gang Liu, Weiping Wang

PDF

Open Access

TL;DR

This paper introduces Re-Critic, a rationale-augmented instruction tuning framework that reduces hallucinations in vision-language models by incorporating fundamental reasoning principles and self-critique mechanisms.

Contribution

Re-Critic is a scalable framework that integrates rationale explanations and self-criticism to improve reasoning and reduce hallucinations in multimodal models.

Findings

01

Models fine-tuned with Re-Critic outperform baselines in hallucination mitigation.

02

Re-Critic enhances reasoning abilities across diverse multimodal tasks.

03

The framework improves response grounding and factual accuracy.

Abstract

Despite significant advancements in multimodal reasoning tasks, existing Large Vision-Language Models (LVLMs) are prone to producing visually ungrounded responses when interpreting associated images. In contrast, when humans embark on learning new knowledge, they often rely on a set of fundamental pre-study principles: reviewing outlines to grasp core concepts, summarizing key points to guide their focus and enhance understanding. However, such preparatory actions are notably absent in the current instruction tuning processes. This paper presents Re-Critic, an easily scalable rationale-augmented framework designed to incorporate fundamental rules and chain-of-thought (CoT) as a bridge to enhance reasoning abilities. Specifically, Re-Critic develops a visual rationale synthesizer that scalably augments raw instructions with rationale explanation. To probe more contextually grounded…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI) · Embodied and Extended Cognition

MethodsSparse Evolutionary Training · Focus