Pre-Training Multimodal Hallucination Detectors with Corrupted Grounding   Data

Spencer Whitehead; Jacob Phillips; Sean Hendryx

arXiv:2409.00238·cs.CL·September 4, 2024

Pre-Training Multimodal Hallucination Detectors with Corrupted Grounding Data

Spencer Whitehead, Jacob Phillips, Sean Hendryx

PDF

Open Access

TL;DR

This paper introduces a sequence labeling approach for detecting and localizing hallucinated text in multimodal language models, utilizing corrupted grounding data for pre-training to improve sample efficiency and detection accuracy.

Contribution

It presents a novel sequence labeling framework for hallucination detection and a data augmentation method using corrupted grounding data for pre-training.

Findings

01

Pre-training on corrupted grounding data enhances detection performance.

02

The proposed method improves sample efficiency during fine-tuning.

03

Grounding data-based learning signals are crucial for effective hallucination localization.

Abstract

Multimodal language models can exhibit hallucinations in their outputs, which limits their reliability. The ability to automatically detect these errors is important for mitigating them, but has been less explored and existing efforts do not localize hallucinations, instead framing this as a classification task. In this work, we first pose multimodal hallucination detection as a sequence labeling task where models must localize hallucinated text spans and present a strong baseline model. Given the high cost of human annotations for this task, we propose an approach to improve the sample efficiency of these models by creating corrupted grounding data, which we use for pre-training. Leveraging phrase grounding data, we generate hallucinations to replace grounded spans and create hallucinated text. Experiments show that pre-training on this data improves sample efficiency when fine-tuning,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMental Health Research Topics · Functional Brain Connectivity Studies · Hallucinations in medical conditions