Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

A Survey of Training-free Diffusion-based Image Generation with Free-form Mask

Full metadata record
DC Field Value Language
dc.contributor.authorPark, Yoonseo-
dc.contributor.authorJo, Hyeongseob-
dc.contributor.authorCho, Sung In-
dc.date.accessioned2025-09-25T06:30:13Z-
dc.date.available2025-09-25T06:30:13Z-
dc.date.issued2025-
dc.identifier.issn2997-7401-
dc.identifier.issn2997-741X-
dc.identifier.urihttps://scholarworks.dongguk.edu/handle/sw.dongguk/61611-
dc.description.abstractLayout-to-image generation is a task that generates realistic images based on given layouts and corresponding textual descriptions. The layout provides structural information about the image, such as descriptions, positions, and sizes of objects. Traditional methods for layout-to-image generation relied on bounding boxes, which represent only fixed-form layouts. Recently, approaches using free-form masks have gained attention, as they enable more flexible control over the shapes and positions of objects. Among these, training-free methods have been proposed that leverage pre-trained diffusion models without additional training. These methods adjust modified attention and guidance mechanisms to steer the image generation process during the inference phase of the diffusion model. In this paper, we review training-free diffusion-based image generation methods that utilize free-form masks. We focus on three representative methods: Paint-with-Words, MultiDiffusion, and Zero-Painter. We analyze their generation strategies and key mechanisms, as well as their limitations regarding spatial accuracy and consistency in object placement. © 2025 IEEE.-
dc.language영어-
dc.language.isoENG-
dc.publisherIEEE-
dc.titleA Survey of Training-free Diffusion-based Image Generation with Free-form Mask-
dc.typeArticle-
dc.publisher.location미국-
dc.identifier.doi10.1109/ITC-CSCC66376.2025.11137628-
dc.identifier.scopusid2-s2.0-105016391894-
dc.identifier.bibliographicCitation2025 International Technical Conference on Circuits/Systems, Computers, and Communications-
dc.citation.title2025 International Technical Conference on Circuits/Systems, Computers, and Communications-
dc.type.docTypeConference paper-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassforeign-
dc.subject.keywordAuthorcross-attention-
dc.subject.keywordAuthordiffusion models-
dc.subject.keywordAuthorfree-form mask-
dc.subject.keywordAuthorlayout-to-image generation-
dc.subject.keywordAuthortraining-free-
Files in This Item
There are no files associated with this item.
Appears in
Collections
ETC > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetrics

Total Views & Downloads

BROWSE