Comprehensive Evaluation of Large Multimodal Models for Nutrition Analysis: A New Benchmark Enriched with Contextual Metadata
Bruce Coburn, Jiangpeng He, Megan E. Rollo, Satvinder S. Dhaliwal, Deborah A. Kerr, Fengqing Zhu

TL;DR
This paper evaluates large multimodal models for nutrition analysis, demonstrating that integrating contextual metadata like location and time significantly improves prediction accuracy, and introduces a new dataset for benchmarking.
Contribution
It introduces ACETADA, a new publicly available dataset, and systematically assesses how contextual metadata enhances LMM performance in nutrition estimation tasks.
Findings
Metadata integration reduces MAE and MAPE in nutritional predictions.
Context-aware prompting strategies outperform image-only prompts.
Incorporating reasoning modifiers further improves model accuracy.
Abstract
Large Multimodal Models (LMMs) are increasingly applied to meal images for nutrition analysis. However, existing work primarily evaluates proprietary models, such as GPT-4. This leaves the broad range of LLMs underexplored. Additionally, the influence of integrating contextual metadata and its interaction with various reasoning modifiers remains largely uncharted. This work investigates how interpreting contextual metadata derived from GPS coordinates (converted to location/venue type), timestamps (transformed into meal/day type), and the food items present can enhance LMM performance in estimating key nutritional values. These values include calories, macronutrients (protein, carbohydrates, fat), and portion sizes. We also introduce \textbf{ACETADA}, a new food-image dataset slated for public release. This open dataset provides nutrition information verified by the dietitian and serves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNutritional Studies and Diet · Biomedical Text Mining and Ontologies
