Deep Learning Based Named Entity Recognition Models for Recipes
Mansi Goel, Ayush Agarwal, Shubham Agrawal, Janak Kapuriya, Akhil, Vamshi Konam, Rishabh Gupta, Shrey Rastogi, Niharika, and Ganesh Bagler

TL;DR
This paper develops and evaluates deep learning models for recognizing named entities in recipes, demonstrating that fine-tuned spaCy-transformer models outperform large language models with few-shot prompting across multiple datasets.
Contribution
It introduces a large, diverse dataset for recipe NER and systematically compares deep learning approaches, highlighting the effectiveness of fine-tuned models over LLM prompting.
Findings
Fine-tuned spaCy-transformer achieved macro-F1 scores above 95%.
Few-shot prompting on LLMs performed poorly for recipe NER.
Created a large, annotated dataset from recipe texts for improved NER training.
Abstract
Food touches our lives through various endeavors, including flavor, nourishment, health, and sustainability. Recipes are cultural capsules transmitted across generations via unstructured text. Automated protocols for recognizing named entities, the building blocks of recipe text, are of immense value for various applications ranging from information extraction to novel recipe generation. Named entity recognition is a technique for extracting information from unstructured or semi-structured data with known labels. Starting with manually-annotated data of 6,611 ingredient phrases, we created an augmented dataset of 26,445 phrases cumulatively. Simultaneously, we systematically cleaned and analyzed ingredient phrases from RecipeDB, the gold-standard recipe data repository, and annotated them using the Stanford NER. Based on the analysis, we sampled a subset of 88,526 phrases using a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Web Data Mining and Analysis
