Detailed Information

Cited 0 time in webofscience Cited 1 time in scopus
Metadata Downloads

CIC: A Framework for Culturally-Aware Image Captioning

Full metadata record
DC Field Value Language
dc.contributor.authorYun, Youngsik-
dc.contributor.authorKim, Jihie-
dc.date.accessioned2024-10-14T05:00:10Z-
dc.date.available2024-10-14T05:00:10Z-
dc.date.issued2024-08-
dc.identifier.issn1045-0823-
dc.identifier.urihttps://scholarworks.dongguk.edu/handle/sw.dongguk/26434-
dc.description.abstractImage Captioning generates descriptive sentences from images using Vision-Language Pre-trained models (VLPs) such as BLIP, which has improved greatly.However, current methods lack the generation of detailed descriptive captions for the cultural elements depicted in the images, such as the traditional clothing worn by people from Asian cultural groups.In this paper, we propose a new framework, Culturally-aware Image Captioning (CIC), that generates captions and describes cultural elements extracted from cultural visual elements in images representing cultures.Inspired by methods combining visual modality and Large Language Models (LLMs) through appropriate prompts, our framework (1) generates questions based on cultural categories from images, (2) extracts cultural visual elements from Visual Question Answering (VQA) using generated questions, and (3) generates culturally-aware captions using LLMs with the prompts.Our human evaluation conducted on 45 participants from 4 different cultural groups with a high understanding of the corresponding culture shows that our proposed framework generates more culturally descriptive captions when compared to the image captioning baseline based on VLPs.Resources can be found at https://shane3606.github.io/cic. © 2024 International Joint Conferences on Artificial Intelligence. All rights reserved.-
dc.format.extent9-
dc.language영어-
dc.language.isoENG-
dc.publisherInternational Joint Conferences on Artificial Intelligence-
dc.titleCIC: A Framework for Culturally-Aware Image Captioning-
dc.typeArticle-
dc.publisher.location미국-
dc.identifier.doi10.24963/ijcai.2024/180-
dc.identifier.scopusid2-s2.0-85204308434-
dc.identifier.wosid001347142801082-
dc.identifier.bibliographicCitationIJCAI International Joint Conference on Artificial Intelligence, pp 1625 - 1633-
dc.citation.titleIJCAI International Joint Conference on Artificial Intelligence-
dc.citation.startPage1625-
dc.citation.endPage1633-
dc.type.docTypeProceedings Paper-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaMathematics-
dc.relation.journalWebOfScienceCategoryComputer Science, Artificial Intelligence-
dc.relation.journalWebOfScienceCategoryComputer Science, Theory & Methods-
dc.relation.journalWebOfScienceCategoryMathematics, Applied-
dc.subject.keywordAuthorEconomic And Social Effects-
dc.subject.keywordAuthorModeling Languages-
dc.subject.keywordAuthor'current-
dc.subject.keywordAuthorCultural Groups-
dc.subject.keywordAuthorHuman Evaluation-
dc.subject.keywordAuthorImage Captioning-
dc.subject.keywordAuthorLanguage Model-
dc.subject.keywordAuthorQuestion Answering-
dc.subject.keywordAuthorVisual Elements-
dc.subject.keywordAuthorVisual Modalities-
dc.subject.keywordAuthorVisual Languages-
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Advanced Convergence Engineering > Department of Computer Science and Artificial Intelligence > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Ji Hie photo

Kim, Ji Hie
College of Advanced Convergence Engineering (Department of Computer Science and Artificial Intelligence)
Read more

Altmetrics

Total Views & Downloads

BROWSE