Fine-tuning Language Models for Recipe Generation: A Comparative   Analysis and Benchmark Study

Anneketh Vij; Changhao Liu; Rahul Anil Nair; Theodore Eugene Ho,; Edward Shi; Ayan Bhowmick

arXiv:2502.02028·cs.CL·February 18, 2025

Fine-tuning Language Models for Recipe Generation: A Comparative Analysis and Benchmark Study

Anneketh Vij, Changhao Liu, Rahul Anil Nair, Theodore Eugene Ho,, Edward Shi, Ayan Bhowmick

PDF

Open Access

TL;DR

This paper compares various small language models for recipe generation, developing new evaluation metrics and allergen substitution methods, revealing nuanced relationships between model size, domain-specific quality, and performance.

Contribution

It introduces a comprehensive evaluation framework with domain-specific metrics and allergen substitution techniques for recipe generation using small language models.

Findings

01

Larger models generally perform better on standard metrics.

02

SmolLM-360M and SmolLM-1.7B show similar performance after fine-tuning.

03

Phi-2 has limitations in recipe generation despite larger size.

Abstract

This research presents an exploration and study of the recipe generation task by fine-tuning various very small language models, with a focus on developing robust evaluation metrics and comparing across different language models the open-ended task of recipe generation. This study presents extensive experiments with multiple model architectures, ranging from T5-small (Raffel et al., 2023) and SmolLM-135M(Allal et al., 2024) to Phi-2 (Research, 2023), implementing both traditional NLP metrics and custom domain-specific evaluation metrics. Our novel evaluation framework incorporates recipe-specific metrics for assessing content quality and introduces approaches to allergen substitution. The results indicate that, while larger models generally perform better on standard metrics, the relationship between model size and recipe quality is more nuanced when considering domain-specific metrics.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling

MethodsFocus