An Improved Diffusion Model for Generating Images of a Single Category of Food on a Small Dataset
Zitian Chen, Zhiyong Xiao, Dinghui Wu, Qingbing Sang

TL;DR
This paper introduces a new diffusion model that generates high-quality food images using limited data, improving food classification accuracy through synthetic augmentation.
Contribution
The novel Ingredient-Aware Diffusion Model with LIE and CA mechanisms enables high-fidelity food image synthesis in data-scarce scenarios.
Findings
The proposed model achieves state-of-the-art generation quality on Food-101 and VireoFood-172 datasets.
Using synthetic images for data augmentation improved downstream food classification accuracy from 95.65% to 96.20%.
The model's linear interpolation strategy stabilizes training with limited data samples.
Abstract
In the era of the digital food economy, high-fidelity food images are critical for applications ranging from visual e-commerce presentation to automated dietary assessment. However, developing robust computer vision systems for food analysis is often hindered by data scarcity for long-tail or regional dishes. To address this challenge, we propose a novel high-fidelity food image synthesis framework as an effective data augmentation tool. Unlike generic generative models, our method introduces an Ingredient-Aware Diffusion Model based on the Masked Diffusion Transformer (MaskDiT) architecture. Specifically, we design a Label and Ingredients Encoding (LIE) module and a Cross-Attention (CA) mechanism to explicitly model the relationship between food composition and visual appearance, simulating the “cooking” process digitally. Furthermore, to stabilize training on limited data samples, we…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNutritional Studies and Diet · Generative Adversarial Networks and Image Synthesis · Agriculture Sustainability and Environmental Impact
