Information-Theoretic Text Hallucination Reduction for Video-grounded Dialogue
Sunjae Yoon, Eunseop Yoon, Hee Suk Yoon, Junyeong Kim, Chang D. Yoo

TL;DR
This paper introduces an information-theoretic framework to reduce text hallucination in video-grounded dialogue systems, improving answer accuracy and interpretability by discouraging indiscriminate copying from input texts.
Contribution
It proposes the Text Hallucination Mitigating (THAM) framework with a novel regularization loss based on information theory to address text hallucination in VGD systems.
Findings
Improved performance on AVSD@DSTC7 and AVSD@DSTC8 benchmarks.
Enhanced interpretability of dialogue systems.
Effective reduction of text hallucination in VGD tasks.
Abstract
Video-grounded Dialogue (VGD) aims to decode an answer sentence to a question regarding a given video and dialogue context. Despite the recent success of multi-modal reasoning to generate answer sentences, existing dialogue systems still suffer from a text hallucination problem, which denotes indiscriminate text-copying from input texts without an understanding of the question. This is due to learning spurious correlations from the fact that answer sentences in the dataset usually include the words of input texts, thus the VGD system excessively relies on copying words from input texts by hoping those words to overlap with ground-truth texts. Hence, we design Text Hallucination Mitigating (THAM) framework, which incorporates Text Hallucination Regularization (THR) loss derived from the proposed information-theoretic text hallucination measurement approach. Applying THAM with current…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Natural Language Processing Techniques
