Switchable-Encoder-Based Self-Supervised Learning Framework for Monocular Depth and Pose Estimation

Kim, Junoh; Gao, Rui; Park, Jisun; Yoon, Jinsoo; Cho, Kyungeun

Detailed Information

Cited 2 time in webofscience

Cited 2 time in scopus

Metadata Downloads

Switchable-Encoder-Based Self-Supervised Learning Framework for Monocular Depth and Pose Estimationopen access

Authors: Kim, Junoh; Gao, Rui; Park, Jisun; Yoon, Jinsoo; Cho, Kyungeun

Issue Date: Dec-2023

Publisher: MDPI

Keywords: monocular depth estimation; self-supervised learning; structure from motion

Citation: Remote Sensing, v.15, no.24, pp 1 - 25

Pages: 25

Indexed: SCIE
SCOPUS

Journal Title: Remote Sensing

Volume: 15

Number: 24

Start Page: 1

End Page: 25

URI: https://scholarworks.dongguk.edu/handle/sw.dongguk/22718

DOI: 10.3390/rs15245739

ISSN: 2072-4292
2072-4292

Abstract: Monocular depth prediction research is essential for expanding meaning from 2D to 3D. Recent studies have focused on the application of a newly proposed encoder; however, the development within the self-supervised learning framework remains unexplored, an aspect critical for advancing foundational models of 3D semantic interpretation. Addressing the dynamic nature of encoder-based research, especially in performance evaluations for feature extraction and pre-trained models, this research proposes the switchable encoder learning framework (SELF). SELF enhances versatility by enabling the seamless integration of diverse encoders in a self-supervised learning context for depth prediction. This integration is realized through the direct transfer of feature information from the encoder and by standardizing the input structure of the decoder to accommodate various encoder architectures. Furthermore, the framework is extended and incorporated into an adaptable decoder for depth prediction and camera pose learning, employing standard loss functions. Comparative experiments with previous frameworks using the same encoder reveal that SELF achieves a 7% reduction in parameters while enhancing performance. Remarkably, substituting newly proposed algorithms in place of an encoder improves the outcomes as well as significantly decreases the number of parameters by 23%. The experimental findings highlight the ability of SELF to broaden depth factors, such as depth consistency. This framework facilitates the objective selection of algorithms as a backbone for extended research in monocular depth prediction. © 2023 by the authors.

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Advanced Convergence Engineering > Department of Computer Science and Artificial Intelligence > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Cho, Kyung Eun photo

Cho, Kyung Eun: College of Advanced Convergence Engineering (Department of Computer Science and Artificial Intelligence)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE