A Benchmark to Assess Common Ground in Human-AI Collaboration
Christian Poelitz, Finale Doshi-Velez, Si\^an Lindley

TL;DR
This paper introduces a new benchmark for evaluating common ground in human-AI collaboration, based on collaborative puzzle tasks, validated through user studies, highlighting similarities and differences with human-human collaboration.
Contribution
It presents a novel benchmark grounded in human collaboration theories to assess common ground in human-AI interaction, filling a gap in current research.
Findings
Benchmark reproduces human collaboration findings
Reveals divergences in human-AI interaction
Validated through human-AI collaborative puzzle solving
Abstract
AI is becoming increasingly integrated into everyday life, both in professional work environments and in leisure and entertainment contexts. This integration requires AI to move beyond acting as an assistant for informational or transactional tasks toward a genuine collaborative partner. Effective collaboration, whether between humans or between humans and AI, depends on establishing and maintaining common ground: shared beliefs, assumptions, goals, and situational awareness that enable coordinated action and efficient repair of misunderstandings. While common ground is a central concept in human collaboration, it has received limited attention in studies of human-AI collaboration. In this paper, we introduce a new benchmark grounded in theories and empirical studies of human-human collaboration. The benchmark is based on a collaborative puzzle task that requires iterative interaction,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSocial Robot Interaction and HRI · AI in Service Interactions · Ethics and Social Impacts of AI
