A Highly Clean Recipe Dataset with Ingredient States Annotation for State Probing Task
Mashiro Toyooka, Kiyoharu Aizawa, Yoko Yamakata

TL;DR
This paper introduces a new Japanese recipe dataset with ingredient state annotations to evaluate and improve large language models' ability to understand and track ingredient states during cooking procedures.
Contribution
It presents a novel dataset and three tasks for assessing LLMs' understanding of ingredient state transitions in recipes, enhancing their procedural comprehension.
Findings
LLMs can learn ingredient state knowledge to improve cooking process understanding
Performance of Llama3.1-70B and Qwen2.5-72B approaches that of commercial LLMs
The dataset is publicly available for further research
Abstract
Large Language Models (LLMs) are trained on a vast amount of procedural texts, but they do not directly observe real-world phenomena. In the context of cooking recipes, this poses a challenge, as intermediate states of ingredients are often omitted, making it difficult for models to track ingredient states and understand recipes accurately. In this paper, we apply state probing, a method for evaluating a language model's understanding of the world, to the domain of cooking. We propose a new task and dataset for evaluating how well LLMs can recognize intermediate ingredient states during cooking procedures. We first construct a new Japanese recipe dataset with clear and accurate annotations of ingredient state changes, collected from well-structured and controlled recipe texts. Using this dataset, we design three novel tasks to evaluate whether LLMs can track ingredient state transitions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Explainable Artificial Intelligence (XAI) · Software Engineering Research
