CIC: A Framework for Culturally-Aware Image Captioning

Yun, Youngsik; Kim, Jihie

Detailed Information

Cited 0 time in webofscience

Cited 1 time in scopus

Metadata Downloads

CIC: A Framework for Culturally-Aware Image Captioning

Full metadata record

DC Field	Value	Language
dc.contributor.author	Yun, Youngsik	-
dc.contributor.author	Kim, Jihie	-
dc.date.accessioned	2024-10-14T05:00:10Z	-
dc.date.available	2024-10-14T05:00:10Z	-
dc.date.issued	2024-08	-
dc.identifier.issn	1045-0823	-
dc.identifier.uri	https://scholarworks.dongguk.edu/handle/sw.dongguk/26434	-
dc.description.abstract	Image Captioning generates descriptive sentences from images using Vision-Language Pre-trained models (VLPs) such as BLIP, which has improved greatly.However, current methods lack the generation of detailed descriptive captions for the cultural elements depicted in the images, such as the traditional clothing worn by people from Asian cultural groups.In this paper, we propose a new framework, Culturally-aware Image Captioning (CIC), that generates captions and describes cultural elements extracted from cultural visual elements in images representing cultures.Inspired by methods combining visual modality and Large Language Models (LLMs) through appropriate prompts, our framework (1) generates questions based on cultural categories from images, (2) extracts cultural visual elements from Visual Question Answering (VQA) using generated questions, and (3) generates culturally-aware captions using LLMs with the prompts.Our human evaluation conducted on 45 participants from 4 different cultural groups with a high understanding of the corresponding culture shows that our proposed framework generates more culturally descriptive captions when compared to the image captioning baseline based on VLPs.Resources can be found at https://shane3606.github.io/cic. © 2024 International Joint Conferences on Artificial Intelligence. All rights reserved.	-
dc.format.extent	9	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	International Joint Conferences on Artificial Intelligence	-
dc.title	CIC: A Framework for Culturally-Aware Image Captioning	-
dc.type	Article	-
dc.publisher.location	미국	-
dc.identifier.doi	10.24963/ijcai.2024/180	-
dc.identifier.scopusid	2-s2.0-85204308434	-
dc.identifier.wosid	001347142801082	-
dc.identifier.bibliographicCitation	IJCAI International Joint Conference on Artificial Intelligence, pp 1625 - 1633	-
dc.citation.title	IJCAI International Joint Conference on Artificial Intelligence	-
dc.citation.startPage	1625	-
dc.citation.endPage	1633	-
dc.type.docType	Proceedings Paper	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalResearchArea	Mathematics	-
dc.relation.journalWebOfScienceCategory	Computer Science, Artificial Intelligence	-
dc.relation.journalWebOfScienceCategory	Computer Science, Theory & Methods	-
dc.relation.journalWebOfScienceCategory	Mathematics, Applied	-
dc.subject.keywordAuthor	Economic And Social Effects	-
dc.subject.keywordAuthor	Modeling Languages	-
dc.subject.keywordAuthor	'current	-
dc.subject.keywordAuthor	Cultural Groups	-
dc.subject.keywordAuthor	Human Evaluation	-
dc.subject.keywordAuthor	Image Captioning	-
dc.subject.keywordAuthor	Language Model	-
dc.subject.keywordAuthor	Question Answering	-
dc.subject.keywordAuthor	Visual Elements	-
dc.subject.keywordAuthor	Visual Modalities	-
dc.subject.keywordAuthor	Visual Languages	-

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Advanced Convergence Engineering > Department of Computer Science and Artificial Intelligence > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Kim, Ji Hie photo

Kim, Ji Hie: College of Advanced Convergence Engineering (Department of Computer Science and Artificial Intelligence)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE