RAcQUEt: Unveiling the Dangers of Overlooked Referential Ambiguity in Visual LLMs

Alberto Testoni; Barbara Plank; Raquel Fern\'andez

arXiv:2412.13835·cs.CL·September 19, 2025

RAcQUEt: Unveiling the Dangers of Overlooked Referential Ambiguity in Visual LLMs

Alberto Testoni, Barbara Plank, Raquel Fern\'andez

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces RACQUET, a dataset for studying referential ambiguity in visual question answering, revealing that current models often overconfidently misinterpret ambiguous references and produce biased responses.

Contribution

The work presents RACQUET, a novel dataset for analyzing ambiguity in multimodal models, and demonstrates the limitations and biases of state-of-the-art models in handling ambiguity.

Findings

01

Models exhibit overconfidence in ambiguous scenarios.

02

Current models often produce stereotypical, biased responses.

03

Addressing ambiguity is crucial for fair and accurate AI systems.

Abstract

Ambiguity resolution is key to effective communication. While humans effortlessly address ambiguity through conversational grounding strategies, the extent to which current language models can emulate these strategies remains unclear. In this work, we examine referential ambiguity in image-based question answering by introducing RACQUET, a carefully curated dataset targeting distinct aspects of ambiguity. Through a series of evaluations, we reveal significant limitations and problems of overconfidence of state-of-the-art large multimodal language models in addressing ambiguity in their responses. The overconfidence issue becomes particularly relevant for RACQUET-BIAS, a subset designed to analyze a critical yet underexplored problem: failing to address ambiguity leads to stereotypical, socially biased responses. Our results underscore the urgency of equipping models with robust…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

albertotestoni/racquet
noneOfficial

Videos

RAcQUEt: Unveiling the Dangers of Overlooked Referential Ambiguity in Visual LLMs· underline

Taxonomy

TopicsBiomedical Text Mining and Ontologies · Digital Imaging for Blood Diseases · Digital Media Forensic Detection