# Automatic Image Recognition Meal Reporting Among Young Adults: Randomized Controlled Trial

**Authors:** Prasan Kumar Sahoo, Sherry Yueh-Hsia Chiu, Yu-Sheng Lin, Chien-Hung Chen, Denisa Irianti, Hsin-Yun Chen, Mekhla Sarkar, Ying-Chieh Liu

PMC · DOI: 10.2196/60070 · JMIR mHealth and uHealth · 2025-08-14

## TL;DR

A new app using AI to automatically recognize food in images was tested against a voice input app, showing better accuracy and speed in meal reporting among young adults.

## Contribution

The study introduces an AI-powered app for automatic food recognition in real-time meal reporting and compares it to a voice input method.

## Key findings

- The AI image recognition app (AIR) correctly identified 86% of dishes, significantly outperforming the voice input app (VIR) at 68%.
- AIR users completed food reporting faster than VIR users, with a significant difference in time efficiency.
- Both apps were perceived as highly usable and learnable, with no significant difference in user perception scores.

## Abstract

Advances in artificial intelligence technology have raised new possibilities for the effective evaluation of daily dietary intake, but more empirical study is needed for the use of such technologies under realistic meal scenarios. This study developed an automated food recognition technology, which was then integrated into its previous design to improve usability for meal reporting. The newly developed app allowed for the automatic detection and recognition of multiple dishes within a single real-time food image as input. App performance was tested using young adults in authentic dining conditions.

A 2-group comparative study was conducted to assess app performance using metrics including accuracy, efficiency, and user perception. The experimental group, named the automatic image-based reporting (AIR) group, was compared against a control group using the previous version, named the voice input reporting (VIR) group. Each application is primarily designed to facilitate a distinct method of food intake reporting. AIR users capture and upload images of their selected dishes, supplemented with voice commands where appropriate. VIR users supplement the uploaded image with verbal inputs for food names and attributes.

The 2 mobile apps were subjected to a head-to-head parallel randomized evaluation. A cohort of 42 young adults aged 20‐25 years (9 male and 33 female participants) was recruited from a university in Taiwan and randomly assigned to 2 groups, that is, AIR (n=22) and VIR (n=20). Both groups were assessed using the same menu of 17 dishes. Each meal was designed to represent a typical lunch or dinner setting, with 1 staple, 1 main course, and 3 side dishes. All participants used the app on the same type of smartphone, with the interfaces of both using uniform user interactions, icons, and layouts. Analysis of the gathered data focused on assessing reporting accuracy, time efficiency, and user perception.

For the AIR group, 86% (189/220) of dishes were correctly identified, whereas 68% (136/200) of dishes were accurately reported. The AIR group exhibited a significantly higher degree of identification accuracy compared to the VIR group (P<.001). The AIR group also required significantly less time to complete food reporting (P<.001). System usability scale scores showed both apps were perceived as having high usability and learnability (P=.20).

The AIR group outperformed the VIR group concerning accuracy and time efficiency for overall dish reporting within the meal testing scenario. While further technological enhancement may be required, artificial intelligence vision technology integration into existing mobile apps holds promise. Our results provide evidence-based contributions to the integration of automatic image recognition technology into existing apps in terms of user interaction efficacy and overall ease of use. Further empirical work is required, including full-scale randomized controlled trials and assessments of user perception under various conditions.

## Full-text entities

- **Diseases:** AIR (MESH:C564543), chronic diseases (MESH:D002908), weight loss (MESH:D015431), COVID-19 (MESH:D000086382), obesity (MESH:D009765), overweight (MESH:D050177)
- **Chemicals:** SUS (-)
- **Species:** Homo sapiens (human, species) [taxon 9606], Momordica charantia (balsam pear, species) [taxon 3673], Solanum lycopersicum (tomato, species) [taxon 4081], Daucus carota (carrot, species) [taxon 4039], Oryza sativa (Asian cultivated rice, species) [taxon 4530], Brassica oleracea var. italica (asparagus broccoli, varietas) [taxon 36774], Brassica oleracea var. botrytis (cauliflower, varietas) [taxon 3715]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12352700/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12352700/full.md

## References

42 references — full list in the complete paper: https://tomesphere.com/paper/PMC12352700/full.md

---
Source: https://tomesphere.com/paper/PMC12352700