An Annotated Corpus of Reference Resolution for Interpreting Common Grounding
Takuma Udagawa, Akiko Aizawa

TL;DR
This paper introduces a new annotated corpus of referring expressions to better understand and analyze the process of common grounding in dialogue systems, addressing challenges of ambiguity and partial understanding.
Contribution
It provides a large, reliable, and detailed resource for studying reference resolution as a key component of common grounding in natural language dialogue.
Findings
High annotation reliability and natural disagreement levels
Enables detailed analysis of common grounding strategies
Improves interpretation and performance of dialogue systems
Abstract
Common grounding is the process of creating, repairing and updating mutual understandings, which is a fundamental aspect of natural language conversation. However, interpreting the process of common grounding is a challenging task, especially under continuous and partially-observable context where complex ambiguity, uncertainty, partial understandings and misunderstandings are introduced. Interpretation becomes even more challenging when we deal with dialogue systems which still have limited capability of natural language understanding and generation. To address this problem, we consider reference resolution as the central subtask of common grounding and propose a new resource to study its intermediate process. Based on a simple and general annotation schema, we collected a total of 40,172 referring expressions in 5,191 dialogues curated from an existing corpus, along with multiple…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems
