Understanding the Limitations of Diffusion Concept Algebra Through Food
E. Zhixuan Zeng, Yuhao Chen, Alexander Wong

TL;DR
This paper investigates the limitations of diffusion concept algebra in food imagery, revealing how models capture culinary diversity and where biases and weaknesses occur.
Contribution
It introduces a novel analysis of diffusion models in the food domain, highlighting their strengths and limitations through qualitative and quantitative methods.
Findings
Models effectively capture culinary diversity
Biases and limitations are identified in complex food compositions
Insights into model representation of food concepts
Abstract
Image generation techniques, particularly latent diffusion models, have exploded in popularity in recent years. Many techniques have been developed to manipulate and clarify the semantic concepts these large-scale models learn, offering crucial insights into biases and concept relationships. However, these techniques are often only validated in conventional realms of human or animal faces and artistic style transitions. The food domain offers unique challenges through complex compositions and regional biases, which can shed light on the limitations and opportunities within existing methods. Through the lens of food imagery, we analyze both qualitative and quantitative patterns within a concept traversal technique. We reveal measurable insights into the model's ability to capture and represent the nuances of culinary diversity, while also identifying areas where the model's biases and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques · Rough Sets and Fuzzy Logic
MethodsDiffusion
