MoHoBench: Assessing Honesty of Multimodal Large Language Models via Unanswerable Visual Questions

Yanxu Zhu; Shitong Duan; Xiangxu Zhang; Jitao Sang; Peng Zhang; Tun Lu; Xiao Zhou; Jing Yao; Xiaoyuan Yi; Xing Xie

arXiv:2507.21503·cs.AI·January 14, 2026

MoHoBench: Assessing Honesty of Multimodal Large Language Models via Unanswerable Visual Questions

Yanxu Zhu, Shitong Duan, Xiangxu Zhang, Jitao Sang, Peng Zhang, Tun Lu, Xiao Zhou, Jing Yao, Xiaoyuan Yi, Xing Xie

PDF

1 Video

TL;DR

This paper introduces MoHoBench, a large-scale benchmark for evaluating honesty in multimodal large language models when faced with visually unanswerable questions, revealing significant honesty issues and proposing initial alignment solutions.

Contribution

It is the first systematic assessment of honesty in MMLMs using a comprehensive benchmark and proposes initial methods to improve honesty behavior.

Findings

01

Most models fail to refuse unanswerable questions properly.

02

Honesty in MMLMs is influenced by visual information, not just language.

03

Proposed alignment methods improve honesty responses.

Abstract

Recently Multimodal Large Language Models (MLLMs) have achieved considerable advancements in vision-language tasks, yet produce potentially harmful or untrustworthy content. Despite substantial work investigating the trustworthiness of language models, MMLMs' capability to act honestly, especially when faced with visually unanswerable questions, remains largely underexplored. This work presents the first systematic assessment of honesty behaviors across various MLLMs. We ground honesty in models' response behaviors to unanswerable visual questions, define four representative types of such questions, and construct MoHoBench, a large-scale MMLM honest benchmark, consisting of 12k+ visual question samples, whose quality is guaranteed by multi-stage filtering and human verification. Using MoHoBench, we benchmarked the honesty of 28 popular MMLMs and conducted a comprehensive analysis. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

MoHoBench: Assessing Honesty of Multimodal Large Language Models via Unanswerable Visual Questions· underline