Cited 0 time in
Image-Text Embedding with Hierarchical Knowledge for Cross-Modal Retrieval
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Seo, Sanghyun | - |
| dc.contributor.author | Kim, Juntae | - |
| dc.date.accessioned | 2023-04-28T10:41:21Z | - |
| dc.date.available | 2023-04-28T10:41:21Z | - |
| dc.date.issued | 2018-12-08 | - |
| dc.identifier.uri | https://scholarworks.dongguk.edu/handle/sw.dongguk/10012 | - |
| dc.description.abstract | Heterogeneous data embedding is a process of mapping different kinds of data into a common vector space of a certain dimension. Image-text embedding also means mapping image and text data that have completely different characteristics into a common vector space. In this paper, we propose an image-text embedding method using hierarchical knowledge such as coarse and fine labels of text data. The proposed method improves the training efficiency of the embedding model by fixing the coarse label vectors. In addition, the loss function is designed by arbitrarily selecting the negative sample from the fine labels having a hierarchical relationship with the coarse label, so that the difference between the vectors of the fine labels which have same coarse label becomes larger. So, when the images that are visual data is mapped into a common vector space, the semantic of images becomes clear. Experimental results show that embedding with hierarchical knowledge has been successfully performed using the proposed methodology and that cross-modal retrieval can be efficiently performed through embedding model. | - |
| dc.format.extent | 4 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | ASSOC COMPUTING MACHINERY | - |
| dc.title | Image-Text Embedding with Hierarchical Knowledge for Cross-Modal Retrieval | - |
| dc.type | Article | - |
| dc.publisher.location | 미국 | - |
| dc.identifier.doi | 10.1145/3297156.3297244 | - |
| dc.identifier.scopusid | 2-s2.0-85062768220 | - |
| dc.identifier.wosid | 000469786300067 | - |
| dc.identifier.bibliographicCitation | PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2018) / 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY (ICIMT 2018), pp 350 - 353 | - |
| dc.citation.title | PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2018) / 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY (ICIMT 2018) | - |
| dc.citation.startPage | 350 | - |
| dc.citation.endPage | 353 | - |
| dc.type.docType | Proceedings Paper | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Artificial Intelligence | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Theory & Methods | - |
| dc.subject.keywordAuthor | Heterogeneous Data Embedding | - |
| dc.subject.keywordAuthor | Image Text Embedding | - |
| dc.subject.keywordAuthor | Hierarchical Knowledge | - |
| dc.subject.keywordAuthor | Cross-modal Retrieval | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114
Copyright(c) 2023 DONGGUK UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
