Interpreting and Mitigating Hallucination in MLLMs through Multi-agent   Debate

Zheng Lin; Zhenxing Niu; Zhibin Wang; Yinghui Xu

arXiv:2407.20505·cs.CV·July 31, 2024

Interpreting and Mitigating Hallucination in MLLMs through Multi-agent Debate

Zheng Lin, Zhenxing Niu, Zhibin Wang, Yinghui Xu

PDF

Open Access

TL;DR

This paper introduces a multi-agent debate framework with self-reflection to interpret, mitigate, and understand hallucinations in multimodal large language models, emphasizing divergent and slow thinking.

Contribution

It proposes a novel multi-agent debate approach combined with self-reflection to reduce hallucinations and interpret their causes in MLLMs, advancing beyond prior detection methods.

Findings

01

Effective hallucination mitigation across multiple benchmarks

02

Enhanced interpretability of hallucination causes

03

Improved evaluation of MLLMs' creativity capabilities

Abstract

MLLMs often generate outputs that are inconsistent with the visual content, a challenge known as hallucination. Previous methods focus on determining whether a generated output is hallucinated, without identifying which image region leads to the hallucination or interpreting why such hallucinations occur. In this paper, we argue that hallucination in MLLMs is partially due to a lack of slow-thinking and divergent-thinking in these models. To address this, we propose adopting a self-reflection scheme to promote slow-thinking. Furthermore, we consider eliminating hallucination as a complex reasoning task and propose a multi-agent debate approach to encourage divergent-thinking. Consequently, our approach can not only mitigate hallucinations but also interpret why they occur and detail the specifics of hallucination. In addition, we propose to distinguish creativity from hallucination in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopological and Geometric Data Analysis · Mental Health Research Topics · Psychedelics and Drug Studies

MethodsFocus