TL;DR
This paper reveals that in diffusion models, the resolution of ambiguous concepts is primarily governed by self-attention layers, and introduces ICM, a targeted intervention method that improves debiasing performance.
Contribution
The study introduces a probing-based localization technique to identify key layers and proposes ICM, a precise intervention method for steering diffusion models.
Findings
Self-attention layers are crucial for resolving ambiguous concepts.
Targeted interventions on specific layers improve debiasing.
ICM outperforms existing methods in minimizing artifacts.
Abstract
Text-to-image diffusion models exhibit remarkable generative capabilities, yet their internal operations remain opaque, particularly when handling prompts that are not fully descriptive. In such scenarios, models must make implicit decisions to generate details not explicitly specified in the text. This work investigates the hypothesis that this decision-making process is not diffuse but is computationally localized within the model's architecture. While existing localization techniques focus on prompt-related interventions, we notice that such explicit conditioning may differ from implicit decisions. Therefore, we introduce a probing-based localization technique to identify the layers with the highest attribute separability for concepts. Our findings indicate that the resolution of ambiguous concepts is governed principally by self-attention layers, identifying them as the most…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
