Pre-Training Multimodal Hallucination Detectors with Corrupted Grounding Data
Spencer Whitehead, Jacob Phillips, Sean Hendryx

TL;DR
This paper introduces a sequence labeling approach for detecting and localizing hallucinated text in multimodal language models, utilizing corrupted grounding data for pre-training to improve sample efficiency and detection accuracy.
Contribution
It presents a novel sequence labeling framework for hallucination detection and a data augmentation method using corrupted grounding data for pre-training.
Findings
Pre-training on corrupted grounding data enhances detection performance.
The proposed method improves sample efficiency during fine-tuning.
Grounding data-based learning signals are crucial for effective hallucination localization.
Abstract
Multimodal language models can exhibit hallucinations in their outputs, which limits their reliability. The ability to automatically detect these errors is important for mitigating them, but has been less explored and existing efforts do not localize hallucinations, instead framing this as a classification task. In this work, we first pose multimodal hallucination detection as a sequence labeling task where models must localize hallucinated text spans and present a strong baseline model. Given the high cost of human annotations for this task, we propose an approach to improve the sample efficiency of these models by creating corrupted grounding data, which we use for pre-training. Leveraging phrase grounding data, we generate hallucinations to replace grounded spans and create hallucinated text. Experiments show that pre-training on this data improves sample efficiency when fine-tuning,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health Research Topics · Functional Brain Connectivity Studies · Hallucinations in medical conditions
