Cited 0 time in
BDA: Bi-directional attention for zero-shot learning
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Lee, Junseok | - |
| dc.contributor.author | Cao, Jinming | - |
| dc.contributor.author | Yin, Yifang | - |
| dc.contributor.author | Kim, Jihie | - |
| dc.contributor.author | Zimmermann, Roger | - |
| dc.contributor.author | Park, Seongsik | - |
| dc.date.accessioned | 2025-11-28T07:30:51Z | - |
| dc.date.available | 2025-11-28T07:30:51Z | - |
| dc.date.issued | 2025-10 | - |
| dc.identifier.issn | 2096-0433 | - |
| dc.identifier.issn | 2096-0662 | - |
| dc.identifier.uri | https://scholarworks.dongguk.edu/handle/sw.dongguk/62159 | - |
| dc.description.abstract | Zero-shot learning (ZSL) is an important and rapidly growing area of machine learning that aims to recognize new classes without prior training data. Despite its significance, ZSL has faced challenges with overfitting in embedding-based methods and limitations in traditional one-directional attention (ODA) based approaches. To bridge these gaps, this paper proposes the use of bi-directional attention (BDA) to integrate insights from both embedding and attention-based approaches. The proposed BDA system consists of a bi-directional attention network (BDAN) and a synthesized visual embedding network (SVEN) that facilitates visual-semantic interaction for ZSL classification. More specifically, the BDAN employs region self-attention (RSA), semantic synthesis attention (SSA), and visual synthesis attention (VSA) to overcome the overfitting issue in embedding methods and enhance transferability, to associate visual features with semantic property information, and to learn locally improved visual features. Extensive testing on CUB, SUN, and AWA2 datasets confirm the superiority of our proposed method over traditional approaches. © 2025 Elsevier B.V., All rights reserved. | - |
| dc.format.extent | 21 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | Tsinghua University Press | - |
| dc.title | BDA: Bi-directional attention for zero-shot learning | - |
| dc.type | Article | - |
| dc.publisher.location | 중국 | - |
| dc.identifier.doi | 10.26599/CVM.2025.9450401 | - |
| dc.identifier.scopusid | 2-s2.0-105021021968 | - |
| dc.identifier.wosid | 001616145400011 | - |
| dc.identifier.bibliographicCitation | Computational Visual Media, v.11, no.5, pp 983 - 1003 | - |
| dc.citation.title | Computational Visual Media | - |
| dc.citation.volume | 11 | - |
| dc.citation.number | 5 | - |
| dc.citation.startPage | 983 | - |
| dc.citation.endPage | 1003 | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | Y | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Software Engineering | - |
| dc.subject.keywordAuthor | bi-directional attention (BDA) | - |
| dc.subject.keywordAuthor | interaction | - |
| dc.subject.keywordAuthor | transferability | - |
| dc.subject.keywordAuthor | zero-shot learning (ZSL) | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114
Copyright(c) 2023 DONGGUK UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
