Image-Text Embedding with Hierarchical Knowledge for Cross-Modal Retrieval

Seo, Sanghyun; Kim, Juntae

Detailed Information

Cited 2 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Image-Text Embedding with Hierarchical Knowledge for Cross-Modal Retrieval

Full metadata record

DC Field	Value	Language
dc.contributor.author	Seo, Sanghyun	-
dc.contributor.author	Kim, Juntae	-
dc.date.accessioned	2023-04-28T10:41:21Z	-
dc.date.available	2023-04-28T10:41:21Z	-
dc.date.issued	2018-12-08	-
dc.identifier.uri	https://scholarworks.dongguk.edu/handle/sw.dongguk/10012	-
dc.description.abstract	Heterogeneous data embedding is a process of mapping different kinds of data into a common vector space of a certain dimension. Image-text embedding also means mapping image and text data that have completely different characteristics into a common vector space. In this paper, we propose an image-text embedding method using hierarchical knowledge such as coarse and fine labels of text data. The proposed method improves the training efficiency of the embedding model by fixing the coarse label vectors. In addition, the loss function is designed by arbitrarily selecting the negative sample from the fine labels having a hierarchical relationship with the coarse label, so that the difference between the vectors of the fine labels which have same coarse label becomes larger. So, when the images that are visual data is mapped into a common vector space, the semantic of images becomes clear. Experimental results show that embedding with hierarchical knowledge has been successfully performed using the proposed methodology and that cross-modal retrieval can be efficiently performed through embedding model.	-
dc.format.extent	4	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	ASSOC COMPUTING MACHINERY	-
dc.title	Image-Text Embedding with Hierarchical Knowledge for Cross-Modal Retrieval	-
dc.type	Article	-
dc.publisher.location	미국	-
dc.identifier.doi	10.1145/3297156.3297244	-
dc.identifier.scopusid	2-s2.0-85062768220	-
dc.identifier.wosid	000469786300067	-
dc.identifier.bibliographicCitation	PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2018) / 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY (ICIMT 2018), pp 350 - 353	-
dc.citation.title	PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2018) / 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY (ICIMT 2018)	-
dc.citation.startPage	350	-
dc.citation.endPage	353	-
dc.type.docType	Proceedings Paper	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalWebOfScienceCategory	Computer Science, Artificial Intelligence	-
dc.relation.journalWebOfScienceCategory	Computer Science, Theory & Methods	-
dc.subject.keywordAuthor	Heterogeneous Data Embedding	-
dc.subject.keywordAuthor	Image Text Embedding	-
dc.subject.keywordAuthor	Hierarchical Knowledge	-
dc.subject.keywordAuthor	Cross-modal Retrieval	-

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Advanced Convergence Engineering > Department of Computer Science and Artificial Intelligence > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Kim, Jun Tae photo

Kim, Jun Tae: College of Advanced Convergence Engineering (Department of Computer Science and Artificial Intelligence)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE