Fine-tuning Language Models for Recipe Generation: A Comparative Analysis and Benchmark Study
Anneketh Vij, Changhao Liu, Rahul Anil Nair, Theodore Eugene Ho,, Edward Shi, Ayan Bhowmick

TL;DR
This paper compares various small language models for recipe generation, developing new evaluation metrics and allergen substitution methods, revealing nuanced relationships between model size, domain-specific quality, and performance.
Contribution
It introduces a comprehensive evaluation framework with domain-specific metrics and allergen substitution techniques for recipe generation using small language models.
Findings
Larger models generally perform better on standard metrics.
SmolLM-360M and SmolLM-1.7B show similar performance after fine-tuning.
Phi-2 has limitations in recipe generation despite larger size.
Abstract
This research presents an exploration and study of the recipe generation task by fine-tuning various very small language models, with a focus on developing robust evaluation metrics and comparing across different language models the open-ended task of recipe generation. This study presents extensive experiments with multiple model architectures, ranging from T5-small (Raffel et al., 2023) and SmolLM-135M(Allal et al., 2024) to Phi-2 (Research, 2023), implementing both traditional NLP metrics and custom domain-specific evaluation metrics. Our novel evaluation framework incorporates recipe-specific metrics for assessing content quality and introduces approaches to allergen substitution. The results indicate that, while larger models generally perform better on standard metrics, the relationship between model size and recipe quality is more nuanced when considering domain-specific metrics.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
MethodsFocus
