TL;DR
This paper explores hallucination in Vision-Language Models from a cognitive psychology perspective, introducing a taxonomy of biases, a new benchmark, and analyzing how model size affects these biases, with validation from human studies.
Contribution
It introduces a psychological taxonomy of VLM hallucinations, a scalable benchmark AIpsych, and provides insights into how model architecture influences cognitive biases in responses.
Findings
Larger models show increased sycophancy.
Model size reduction of authority bias.
Human study validates psychological bias hypotheses.
Abstract
Hallucination is a long-standing problem that has been actively investigated in Vision-Language Models (VLMs). Existing research commonly attributes hallucinations to technical limitations or sycophancy bias, where the latter means the models tend to generate incorrect answers to align with user expectations. However, these explanations primarily focus on technical or externally driven factors, and may have neglected the possibility that hallucination behaviours might mirror cognitive biases observed in human psychology. In this work, we introduce a psychological taxonomy, categorizing VLMs' cognitive biases that lead to hallucinations, including sycophancy, logical inconsistency, and a newly identified VLMs behaviour: appeal to authority. To systematically analyze these behaviours, we design AIpsych, a scalable benchmark that reveals psychological tendencies in model response patterns.…
Peer Reviews
Decision·Submitted to ICLR 2026
- The paper examines the problems discussed from an interesting angle. I have to admit this is the first paper I've read that tries to fit human psychology concepts and anthropomorphize them this much. It remains to be seen whether this is the right approach but it is interesting at least. - The benchmark introduced seems to be of reasonable size and I think it will provide insights on model behaviors.
- The writing is a bit poor (eg. at end of page 4: "because we want to see if the model naturally inherent some psychological disorder without intervention") and the presentation is generally confusing. - The paper's central claim rests on a flimsy and unclear distinction between "sycophancy" and "authority bias." The authors essentially rename a known artifact of autoregressive models - bias towards information present in context - to "authority bias." From the context of a VLM responding to a
- This paper is addressing one of important challenges towards the multi-modal AI systems. They re-organized the current hallucination problems in the VLMs' responses into cognitive psychology, which is novel angle for conceptual shift. They defined and curated psychologincal taxanomy for the VLM's bias and hierarchically designed for extend to AIpsych bnehcmark that systematically organizes diverse visual scenarios for the hallucination. - The paper evluates multiple VLM of diffrent scales cov
- The idea of linking cognitive bias with hallucination is interesting, but the data design does not really support this claim. Most tasks in the benchmark are simple binary or attribute-level questions that test visual recognition rather than cognitive reasoning. The results seem to reflect perceptual errors, not true psychological bias. - As one of the critical issues is that the paper mainly describes and categorizes hallucination types without suggesting how to reduce or handle them. There
The paper raises an interesting interdisciplinary perspective, attempting to connect cognitive psychology with VLM hallucination analysis. The writing is clear and the paper is well-structured, making it easy to follow the motivation and experimental design.
1; The mapping between model responses and human cognitive biases is analogical rather than theoretically defined. “Authority bias” closely overlaps with standard instruction following or over-alignment. 2: The study reports frequency trends without statistical testing or causal modeling. Observed scaling effects replicate well-known alignment patterns rather than revealing new mechanisms. 3. The benchmark mainly re-labels existing hallucination behaviors under psychological terms, no mitigation
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
