Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

I2AM: INTERPRETING IMAGE-TO-IMAGE LATENT DIFFUSION MODELS VIA BI-ATTRIBUTION MAPS

Authors
Park, JunseoJang, Hyeryung
Issue Date
2025
Publisher
International Conference on Learning Representations, ICLR
Citation
13th International Conference on Learning Representations, ICLR 2025, pp 21596 - 21618
Pages
23
Indexed
FOREIGN
Journal Title
13th International Conference on Learning Representations, ICLR 2025
Start Page
21596
End Page
21618
URI
https://scholarworks.dongguk.edu/handle/sw.dongguk/58862
DOI
10.48550/arXiv.2407.12331
Abstract
Large-scale diffusion models have made significant advances in image generation, particularly through cross-attention mechanisms. While cross-attention has been well-studied in text-to-image tasks, their interpretability in image-to-image (I2I) diffusion models remains underexplored. This paper introduces Image-to-Image Attribution Maps (I2AM), a method that enhances the interpretability of I2I models by visualizing bidirectional attribution maps, from the reference image to the generated image and vice versa. I2AM aggregates cross-attention scores across time steps, attention heads, and layers, offering insights into how critical features are transferred between images. We demonstrate the effectiveness of I2AM across object detection, inpainting, and super-resolution tasks. Our results demonstrate that I2AM successfully identifies key regions responsible for generating the output, even in complex scenes. Additionally, we introduce the Inpainting Mask Attention Consistency Score (IMACS) as a novel evaluation metric to assess the alignment between attribution maps and inpainting masks, which correlates strongly with existing performance metrics. Through extensive experiments, we show that I2AM enables model debugging and refinement, providing practical tools for improving I2I model's performance and interpretability. © 2025 13th International Conference on Learning Representations, ICLR 2025. All rights reserved.
Files in This Item
There are no files associated with this item.
Appears in
Collections
ETC > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Jang, Hye Ryung photo

Jang, Hye Ryung
College of Advanced Convergence Engineering (Department of Computer Science and Artificial Intelligence)
Read more

Altmetrics

Total Views & Downloads

BROWSE