Reference Games as a Testbed for the Alignment of Model Uncertainty and Clarification Requests

Manar Ali; Judith Sieker; Sina Zarrie{\ss}; Hendrik Buschmeier

arXiv:2601.07820·cs.CL·May 18, 2026

Reference Games as a Testbed for the Alignment of Model Uncertainty and Clarification Requests

Manar Ali, Judith Sieker, Sina Zarrie{\ss}, Hendrik Buschmeier

PDF

TL;DR

This paper investigates whether vision-language models can recognize their own uncertainty and request clarification in reference games, highlighting current limitations and the potential of such tasks for testing model interaction capabilities.

Contribution

It introduces reference games as a testbed for assessing model uncertainty and clarification behavior, providing empirical evaluation of vision-language models in this context.

Findings

01

Models often fail to recognize their own uncertainty.

02

Models struggle to generate appropriate clarification requests.

03

Reference games are effective for testing interaction qualities.

Abstract

In human conversation, both interlocutors play an active role in maintaining mutual understanding. When listeners are uncertain about what speakers mean, for example, they can request clarification. It is an open question for language models whether they can assume a similar listener role, recognizing and expressing their own uncertainty through clarification. We argue that reference games are a suitable testbed to approach this question as they are controlled, self-contained, and make clarification needs explicit and measurable. To test this, we evaluate three vision-language models comparing a baseline reference resolution task to an experiment where the models are instructed to request clarification when uncertain. The results suggest that even in such simple tasks, models often struggle to recognize internal uncertainty and translate it into adequate clarification behavior. This…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeurobiology of Language and Bilingualism · Speech and dialogue systems · Categorization, perception, and language