ARChef: An iOS-Based Augmented Reality Cooking Assistant Powered by   Multimodal Gemini LLM

Rithik Vir; Parsa Madinei

arXiv:2412.00627·cs.HC·December 10, 2024

ARChef: An iOS-Based Augmented Reality Cooking Assistant Powered by Multimodal Gemini LLM

Rithik Vir, Parsa Madinei

PDF

Open Access

TL;DR

This paper introduces an iOS-based augmented reality cooking assistant that integrates multimodal large language models and computer vision to enhance meal planning, ingredient identification, and nutritional guidance, improving accessibility and user experience.

Contribution

It presents a novel AR cooking app leveraging Gemini LLM and ARKit, offering personalized meal suggestions and nutritional info, addressing limitations of existing AR cooking tools.

Findings

01

High user satisfaction in surveys

02

Effective ingredient recognition and recipe generation

03

Enhanced accessibility for diverse users

Abstract

Cooking meals can be difficult, causing many to resort to cookbooks and online recipes. However, relying on these traditional methods of cooking often results in missing ingredients, nutritional hazards, and unsatisfactory meals. Using Augmented Reality (AR) can address these issues; however, current AR cooking applications have poor user interfaces and limited accessibility. This paper proposes a prototype of an iOS application that integrates AR and Computer Vision (CV) into the cooking process. We leverage Google's Gemini Large Language Model (LLM) to identify ingredients in the camera's field of vision and generate recipe choices with detailed nutritional information. Additionally, this application uses Apple's ARKit to create an AR user interface compatible with iOS devices. Users can personalize their meal suggestions by inputting their dietary preferences and rating each meal.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImpulse Buying and Technology Impacts · Augmented Reality Applications