DiffusionPID: Interpreting Diffusion via Partial Information Decomposition
Rushikesh Zawar, Shaurya Dewan, Prakanshul Saxena, Yingshan Chang,, Andrew Luo, Yonatan Bisk

TL;DR
DiffusionPID introduces an information-theoretic method to analyze how text prompts influence diffusion models, revealing insights into concept localization, biases, and word ambiguity, thereby enhancing understanding and interpretability of these models.
Contribution
The paper presents DiffusionPID, a novel application of Partial Information Decomposition to interpret and diagnose text-to-image diffusion models at a granular level.
Findings
Identifies how individual tokens influence generated images.
Recovers gender and ethnicity biases in diffusion models.
Characterizes word ambiguity and similarity from the model's perspective.
Abstract
Text-to-image diffusion models have made significant progress in generating naturalistic images from textual inputs, and demonstrate the capacity to learn and represent complex visual-semantic relationships. While these diffusion models have achieved remarkable success, the underlying mechanisms driving their performance are not yet fully accounted for, with many unanswered questions surrounding what they learn, how they represent visual-semantic relationships, and why they sometimes fail to generalize. Our work presents Diffusion Partial Information Decomposition (DiffusionPID), a novel technique that applies information-theoretic principles to decompose the input text prompt into its elementary components, enabling a detailed examination of how individual tokens and their interactions shape the generated image. We introduce a formal approach to analyze the uniqueness, redundancy, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsDiffusion
