# Accuracy of AI-Based Nutrient Estimation from Standardized Hospital Meal Images: A Comparison with Registered Dietitians

**Authors:** Tomomi Isobe, Lim Wan Zhang, Hana Murakami, Miyu Kadono, Megumi Aso, Atsuko Kayashita, Jun Kayashita

PMC · DOI: 10.3390/nu18060966 · Nutrients · 2026-03-18

## TL;DR

AI models like ChatGPT-4o and Gemini 1.5 Pro can accurately estimate calories and carbs from hospital meal images but overestimate fats, suggesting the need for human oversight and better data.

## Contribution

This study rigorously compares AI and dietitians in estimating nutrients from standardized meal images, revealing AI's overestimation of lipids.

## Key findings

- AI models and dietitians showed high accuracy for energy and carbohydrates (r > 0.8, ±10% range).
- AI models systematically overestimated lipids by more than 20%.
- Protein estimation accuracy was significantly lower for all AI models.

## Abstract

Background: Accurate dietary assessment is vital for preventing malnutrition in aging populations, particularly in home-care settings. Although Large Multimodal Models (LMMs) for nutrient estimation are evolving, their nutrient-specific accuracy requires rigorous validation. Methods: Fifteen standardized hospital meals were photographed under controlled conditions (90-degree angle, 500 lux). Ground truth values were determined by direct weighing. Estimates for energy and macronutrients were performed by 10 registered dietitians (RDs) and 10 AI models (including ChatGPT-4o and Gemini 1.5 Pro). Accuracy was assessed using Pearson’s correlation, Mean Absolute Error (MAE), and Bland–Altman analysis to quantify systematic bias. Results: For energy and carbohydrates, RDs and top-performing AI models (notably ChatGPT-4o and Gemini 1.5 Pro) demonstrated practical accuracy (r > 0.8, frequently within ±10% range). However, accuracy for protein and lipids was significantly lower across all AI models. Specifically, all AI models exhibited a substantial systematic overestimation of lipids (Mean Bias > +20%, p < 0.01), highlighting a critical “invisible nutrient” bias. Conclusions: Current AI tools show potential for caloric and carbohydrate monitoring but struggle with lipid and protein density. These findings emphasize the need for human–AI collaboration (“human-in-the-loop”) and the integration of cooking metadata to improve clinical utility in geriatric nutrition.

## Full-text entities

- **Diseases:** malnutrition (MESH:D044342)
- **Chemicals:** carbohydrate (MESH:D002241), lipid (MESH:D008055)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13029357/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13029357/full.md

## References

26 references — full list in the complete paper: https://tomesphere.com/paper/PMC13029357/full.md

---
Source: https://tomesphere.com/paper/PMC13029357