A Benchmark to Assess Common Ground in Human-AI Collaboration

Christian Poelitz; Finale Doshi-Velez; Si\^an Lindley

arXiv:2602.21337·cs.HC·February 26, 2026

A Benchmark to Assess Common Ground in Human-AI Collaboration

Christian Poelitz, Finale Doshi-Velez, Si\^an Lindley

PDF

Open Access

TL;DR

This paper introduces a new benchmark for evaluating common ground in human-AI collaboration, based on collaborative puzzle tasks, validated through user studies, highlighting similarities and differences with human-human collaboration.

Contribution

It presents a novel benchmark grounded in human collaboration theories to assess common ground in human-AI interaction, filling a gap in current research.

Findings

01

Benchmark reproduces human collaboration findings

02

Reveals divergences in human-AI interaction

03

Validated through human-AI collaborative puzzle solving

Abstract

AI is becoming increasingly integrated into everyday life, both in professional work environments and in leisure and entertainment contexts. This integration requires AI to move beyond acting as an assistant for informational or transactional tasks toward a genuine collaborative partner. Effective collaboration, whether between humans or between humans and AI, depends on establishing and maintaining common ground: shared beliefs, assumptions, goals, and situational awareness that enable coordinated action and efficient repair of misunderstandings. While common ground is a central concept in human collaboration, it has received limited attention in studies of human-AI collaboration. In this paper, we introduce a new benchmark grounded in theories and empirical studies of human-human collaboration. The benchmark is based on a collaborative puzzle task that requires iterative interaction,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSocial Robot Interaction and HRI · AI in Service Interactions · Ethics and Social Impacts of AI