Verb Mirage: Unveiling and Assessing Verb Concept Hallucinations in Multimodal Large Language Models
Zehao Wang, Xinpeng Liu, Yudonglin Zhang, Xiaoqian Wu, Zhou Fang, Yifan Fang, Junfu Pu, Cewu Lu, Yong-Lu Li

TL;DR
This paper investigates the overlooked issue of verb hallucinations in multimodal large language models, revealing their severity, evaluating existing mitigation methods, and proposing a new knowledge-based tuning approach that effectively reduces such hallucinations.
Contribution
It is the first study to analyze verb hallucinations in MLLMs and introduces a novel knowledge-based tuning method to mitigate this specific issue.
Findings
Most state-of-the-art MLLMs suffer from severe verb hallucination.
Existing mitigation methods are ineffective against verb hallucination.
The proposed tuning method significantly reduces verb hallucinations.
Abstract
Multimodal Large Language Models (MLLMs) have garnered significant attention recently and demonstrate outstanding capabilities in various tasks such as OCR, VQA, captioning, . However, hallucination remains a persistent issue. While numerous methods have been proposed to mitigate hallucinations, achieving notable improvements, these methods primarily focus on mitigating hallucinations about concepts. Verb concepts, crucial for understanding human actions, have been largely overlooked. In this paper, to the best of our knowledge, we are the to investigate the phenomenon of MLLMs from various perspectives. Our findings reveal that most state-of-the-art MLLMs suffer from severe verb hallucination. To assess the effectiveness of existing mitigation methods for object concept hallucination on verb…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMental Health via Writing · Machine Learning in Healthcare · Misinformation and Its Impacts
MethodsSoftmax · Attention Is All You Need · Focus
