Ingredient Extraction from Text in the Recipe Domain

Arkin Dharawat; Chris Doan

arXiv:2204.08137·cs.CL·April 19, 2022·1 cites

Ingredient Extraction from Text in the Recipe Domain

Arkin Dharawat, Chris Doan

PDF

Open Access 1 Repo

TL;DR

This paper presents a BERT-based model for extracting ingredients from plain-text recipe queries, achieving high accuracy and aiding virtual assistants in understanding user requests.

Contribution

It introduces a fine-tuned BERT model specifically designed for ingredient extraction in the recipe domain, with publicly available code.

Findings

01

Achieved an F1-score of 95.01% on ingredient extraction.

02

Demonstrated the effectiveness of BERT in domain-specific information extraction.

03

Provided open-source code for reproducibility.

Abstract

In recent years, there has been an increase in the number of devices with virtual assistants (e.g: Siri, Google Home, Alexa) in our living rooms and kitchens. As a result of this, these devices receive several queries about recipes. All these queries will contain terms relating to a "recipe-domain" i.e: they will contain dish-names, ingredients, cooking times, dietary preferences etc. Extracting these recipe-relevant aspects from the query thus becomes important when it comes to addressing the user's information need. Our project focuses on extracting ingredients from such plain-text user utterances. Our best performing model was a fine-tuned BERT which achieved an F1-score of $95.01$ . We have released all our code in a GitHub repository.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ArkinDharawat/ingredient_extraction
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsWeb Data Mining and Analysis · Natural Language Processing Techniques

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Adam · Residual Connection · Dense Connections · Attention Dropout · Layer Normalization · Weight Decay