AmbiK: Dataset of Ambiguous Tasks in Kitchen Environment

Anastasiia Ivanova; Eva Bakaeva; Zoya Volovikova; Alexey K. Kovalev; Aleksandr I. Panov

arXiv:2506.04089·cs.LG·June 5, 2025

AmbiK: Dataset of Ambiguous Tasks in Kitchen Environment

Anastasiia Ivanova, Eva Bakaeva, Zoya Volovikova, Alexey K. Kovalev, Aleksandr I. Panov

PDF

Open Access 1 Repo 1 Datasets 1 Video

TL;DR

AmbiK is a comprehensive, human-validated textual dataset of 1000 ambiguous and unambiguous kitchen tasks designed to facilitate the comparison of ambiguity detection methods for embodied agents.

Contribution

We introduce AmbiK, a novel, fully textual dataset of ambiguous kitchen tasks with detailed annotations, enabling standardized evaluation of ambiguity detection techniques.

Findings

01

Dataset includes 2000 tasks with ambiguity categories

02

Contains environment descriptions, questions, answers, and task plans

03

Facilitates unified benchmarking of ambiguity detection methods

Abstract

As a part of an embodied agent, Large Language Models (LLMs) are typically used for behavior planning given natural language instructions from the user. However, dealing with ambiguous instructions in real-world environments remains a challenge for LLMs. Various methods for task ambiguity detection have been proposed. However, it is difficult to compare them because they are tested on different datasets and there is no universal benchmark. For this reason, we propose AmbiK (Ambiguous Tasks in Kitchen Environment), the fully textual dataset of ambiguous instructions addressed to a robot in a kitchen environment. AmbiK was collected with the assistance of LLMs and is human-validated. It comprises 1000 pairs of ambiguous tasks and their unambiguous counterparts, categorized by ambiguity type (Human Preferences, Common Sense Knowledge, Safety), with environment descriptions, clarifying…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cog-model/ambik-dataset
noneOfficial

Datasets

IvAnastasia/AmbiK
dataset· 22 dl
22 dl

Videos

AmbiK: Dataset of Ambiguous Tasks in Kitchen Environment· underline

Taxonomy

TopicsMultimodal Machine Learning Applications · Social Robot Interaction and HRI · Topic Modeling