Uncertainty-based Visual Question Answering: Estimating Semantic Inconsistency between Image and Knowledge Base
- Authors
- Chae, Jinyeong; Kim, Jihie
- Issue Date
- Jul-2022
- Publisher
- IEEE
- Keywords
- knowledge graph; knowledge-based visual question answering; semantic inconsistency; uncertainty
- Citation
- 2022 International Joint Conference on Neural Networks (IJCNN), v.2022-July
- Indexed
- SCOPUS
- Journal Title
- 2022 International Joint Conference on Neural Networks (IJCNN)
- Volume
- 2022-July
- URI
- https://scholarworks.dongguk.edu/handle/sw.dongguk/3833
- DOI
- 10.1109/IJCNN55064.2022.9892787
- ISSN
- 2161-4393
2161-4407
- Abstract
- Knowledge-based visual question answering (KVQA) task aims to answer questions that require additional external knowledge as well as an understanding of images and questions. Recent studies on KVQA inject an external knowledge in a multi-modal form, and as more knowledge is used, irrelevant information may be added and can confuse the question answering. In order to properly use the knowledge, this study proposes the following: 1) we introduce a novel semantic inconsistency measure computed from caption uncertainty and semantic similarity; 2) we suggest a new external knowledge assimilation method based on the semantic inconsistency measure and apply it to integrate explicit knowledge and implicit knowledge for KVQA; 3) the proposed method is evaluated with the OK-VQA dataset and achieves the state-of-the-art performance. © 2022 IEEE.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Advanced Convergence Engineering > Department of Computer Science and Artificial Intelligence > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.