MeetUp! A Corpus of Joint Activity Dialogues in a Visual Environment
Nikolai Ilinykh, Sina Zarrie{\ss}, David Schlangen

TL;DR
MeetUp! introduces a new dataset of joint activity dialogues in a visual environment, emphasizing the integration of visual perception and conversational grounding to advance AI systems' ability to understand and communicate about visual scenes.
Contribution
The paper presents a novel task and dataset that require combined visual and conversational understanding, addressing oversimplifications in previous datasets.
Findings
Collected dialogues demonstrate complex discourse phenomena.
The dataset challenges language and vision integration.
Dialogues exhibit realistic conversational behaviors.
Abstract
Building computer systems that can converse about their visual environment is one of the oldest concerns of research in Artificial Intelligence and Computational Linguistics (see, for example, Winograd's 1972 SHRDLU system). Only recently, however, have methods from computer vision and natural language processing become powerful enough to make this vision seem more attainable. Pushed especially by developments in computer vision, many data sets and collection environments have recently been published that bring together verbal interaction and visual processing. Here, we argue that these datasets tend to oversimplify the dialogue part, and we propose a task---MeetUp!---that requires both visual and conversational grounding, and that makes stronger demands on representations of the discourse. MeetUp! is a two-player coordination game where players move in a visual environment, with the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Speech and dialogue systems · Human Pose and Action Recognition
