Cited 0 time in
Culture-TRIP: Culturally-Aware Text-to-Image Generation with Iterative Prompt Refinement
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Jeong, Suchae | - |
| dc.contributor.author | Choi, Inseong | - |
| dc.contributor.author | Yun, Youngsik | - |
| dc.contributor.author | Kim, Jihie | - |
| dc.date.accessioned | 2026-01-30T03:00:23Z | - |
| dc.date.available | 2026-01-30T03:00:23Z | - |
| dc.date.issued | 2025-04 | - |
| dc.identifier.uri | https://scholarworks.dongguk.edu/handle/sw.dongguk/63533 | - |
| dc.description.abstract | Text-to-Image models, including Stable Diffusion, have significantly improved in generating images that are highly semantically aligned with the given prompts. However, existing models may fail to produce appropriate images for the cultural concepts or objects that are not well known or underrepresented in western cultures, such as 'hangari' (Korean utensil). In this paper, we propose a novel approach, Culturally-Aware Text-to-Image Generation with Iterative Prompt Refinement (Culture-TRIP), which refines the prompt in order to improve the alignment of the image with such culture nouns in text-to-image models. Our approach (1) retrieves cultural contexts and visual details related to the culture nouns in the prompt and (2) iteratively refines and evaluates the prompt based on a set of cultural criteria and large language models. The refinement process utilizes the information retrieved from Wikipedia and the Web. Our user survey, conducted with 66 participants from eight different countries demonstrates that our proposed approach enhances the alignment between the images and the prompts. In particular, C-TRIP demonstrates improved alignment between the generated images and underrepresented culture nouns. Resource can be found at https://shane3606.github.io/Culture-TRIP. © 2025 Association for Computational Linguistics. | - |
| dc.format.extent | 31 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | Association for Computational Linguistics (ACL) | - |
| dc.title | Culture-TRIP: Culturally-Aware Text-to-Image Generation with Iterative Prompt Refinement | - |
| dc.type | Article | - |
| dc.publisher.location | 미국 | - |
| dc.identifier.doi | 10.18653/v1/2025.naacl-long.483 | - |
| dc.identifier.scopusid | 2-s2.0-105027460766 | - |
| dc.identifier.wosid | 001611654000164 | - |
| dc.identifier.bibliographicCitation | Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, v.1, pp 9543 - 9573 | - |
| dc.citation.title | Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies | - |
| dc.citation.volume | 1 | - |
| dc.citation.startPage | 9543 | - |
| dc.citation.endPage | 9573 | - |
| dc.type.docType | Proceedings Paper | - |
| dc.description.isOpenAccess | Y | - |
| dc.description.journalRegisteredClass | foreign | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalResearchArea | Linguistics | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Artificial Intelligence | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Interdisciplinary Applications | - |
| dc.relation.journalWebOfScienceCategory | Linguistics | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114
Copyright(c) 2023 DONGGUK UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
