I2AM: Interpreting Image-to-Image Latent Diffusion Models via Bi-Attribution Maps
Junseo Park, Hyeryung Jang

TL;DR
This paper presents I2AM, a novel method for visualizing bidirectional attribution maps in image-to-image diffusion models, improving interpretability and aiding model debugging across various tasks.
Contribution
Introduces I2AM, a new approach for interpreting I2I diffusion models by visualizing cross-attention-based attribution maps and proposing the IMACS metric for evaluation.
Findings
I2AM effectively highlights key regions influencing output generation.
IMACS correlates strongly with existing performance metrics.
I2AM facilitates model debugging and refinement.
Abstract
Large-scale diffusion models have made significant advances in image generation, particularly through cross-attention mechanisms. While cross-attention has been well-studied in text-to-image tasks, their interpretability in image-to-image (I2I) diffusion models remains underexplored. This paper introduces Image-to-Image Attribution Maps (I2AM), a method that enhances the interpretability of I2I models by visualizing bidirectional attribution maps, from the reference image to the generated image and vice versa. I2AM aggregates cross-attention scores across time steps, attention heads, and layers, offering insights into how critical features are transferred between images. We demonstrate the effectiveness of I2AM across object detection, inpainting, and super-resolution tasks. Our results demonstrate that I2AM successfully identifies key regions responsible for generating the output, even…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Image Retrieval and Classification Techniques · Radiomics and Machine Learning in Medical Imaging
MethodsSoftmax · Attention Is All You Need · Diffusion · Inpainting
