NutriBench: A Dataset for Evaluating Large Language Models on Nutrition Estimation from Meal Descriptions
Andong Hua, Mehak Preet Dhaliwal, Laya Pullela, Ryan Burke, Yao Qin

TL;DR
NutriBench is a new benchmark dataset for evaluating large language models on nutrition estimation from meal descriptions, enabling assessment of their accuracy and speed in health-related tasks.
Contribution
We introduce NutriBench, the first publicly available dataset for natural language meal nutrition estimation, and evaluate multiple LLMs on this benchmark with real-world relevance.
Findings
LLMs can estimate macro-nutrients with reasonable accuracy.
LLMs provide faster nutrition estimates than professionals.
Carbohydrate predictions impact blood glucose simulations.
Abstract
Accurate nutrition estimation helps people make informed dietary choices and is essential in the prevention of serious health complications. We present NutriBench, the first publicly available natural language meal description nutrition benchmark. NutriBench consists of 11,857 meal descriptions generated from real-world global dietary intake data. The data is human-verified and annotated with macro-nutrient labels, including carbohydrates, proteins, fats, and calories. We conduct an extensive evaluation of NutriBench on the task of carbohydrate estimation, testing twelve leading Large Language Models (LLMs), including GPT-4o, Llama3.1, Qwen2, Gemma2, and OpenBioLLM models, using standard, Chain-of-Thought and Retrieval-Augmented Generation strategies. Additionally, we present a study involving professional nutritionists, finding that LLMs can provide comparable but significantly faster…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCulinary Culture and Tourism · Biomedical Text Mining and Ontologies · Nutrition, Genetics, and Disease
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Residual Connection · Adam · Dropout · Byte Pair Encoding · Cosine Annealing · Layer Normalization · Linear Layer · Weight Decay
