An End-to-end Food Portion Estimation Framework Based on Shape Reconstruction from Monocular Image
Zeman Shao, Gautham Vinod, Jiangpeng He, Fengqing Zhu

TL;DR
This paper introduces an end-to-end deep learning framework that estimates food energy from a single RGB image by reconstructing its 3D shape, improving accuracy while reducing user burden in dietary assessment.
Contribution
It presents a novel monocular image-based 3D shape reconstruction method for food energy estimation, eliminating the need for multi-view or depth data at inference.
Findings
Achieved MAE of 40.05 kCal on Nutrition5k dataset.
Attained MAPE of 11.47%, competitive with multi-view methods.
Uses only RGB images at inference, simplifying data collection.
Abstract
Dietary assessment is a key contributor to monitoring health status. Existing self-report methods are tedious and time-consuming with substantial biases and errors. Image-based food portion estimation aims to estimate food energy values directly from food images, showing great potential for automated dietary assessment solutions. Existing image-based methods either use a single-view image or incorporate multi-view images and depth information to estimate the food energy, which either has limited performance or creates user burdens. In this paper, we propose an end-to-end deep learning framework for food energy estimation from a monocular image through 3D shape reconstruction. We leverage a generative model to reconstruct the voxel representation of the food object from the input image to recover the missing 3D information. Our method is evaluated on a publicly available food image…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNutritional Studies and Diet · Nutrition, Health and Food Behavior · Meat and Animal Product Quality
