Heterogeneous Feature Fusion Module Based on CNN and Transformer for Multiview Stereo Reconstructionopen access
- Authors
- Gao, Rui; Xu, Jiajia; Chen, Yipeng; Cho, Kyungeun
- Issue Date
- Jan-2023
- Publisher
- MDPI
- Keywords
- multi-view stereo; 3D reconstruction; deep learning; transformer
- Citation
- Mathematics, v.11, no.1, pp 1 - 14
- Pages
- 14
- Indexed
- SCIE
SCOPUS
- Journal Title
- Mathematics
- Volume
- 11
- Number
- 1
- Start Page
- 1
- End Page
- 14
- URI
- https://scholarworks.dongguk.edu/handle/sw.dongguk/19197
- DOI
- 10.3390/math11010112
- ISSN
- 2227-7390
2227-7390
- Abstract
- For decades, a vital area of computer vision research has been multiview stereo (MVS), which creates 3D models of a scene using photographs. This study presents an effective MVS network for 3D reconstruction utilizing multiview pictures. Alternative learning-based reconstruction techniques work well, because CNNs (convolutional neural network) can extract only the image's local features; however, they contain many artifacts. Herein, a transformer and CNN are used to extract the global and local features of the image, respectively. Additionally, hierarchical aggregation and heterogeneous interaction modules were used to improve these features. They are based on the transformer and can extract dense features with 3D consistency and global context that are necessary to provide accurate matching for MVS.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Advanced Convergence Engineering > Department of Computer Science and Artificial Intelligence > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.