Pseudo-LiDAR With Two-Dimensional Instance for Monocular Three-Dimensional Object Tracking

Gao, Rui; Kim, Junoh; Chu, Phuong Minh; Cho, Kyungeun

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Pseudo-LiDAR With Two-Dimensional Instance for Monocular Three-Dimensional Object Tracking

Full metadata record

DC Field	Value	Language
dc.contributor.author	Gao, Rui	-
dc.contributor.author	Kim, Junoh	-
dc.contributor.author	Chu, Phuong Minh	-
dc.contributor.author	Cho, Kyungeun	-
dc.date.accessioned	2025-03-31T06:30:16Z	-
dc.date.available	2025-03-31T06:30:16Z	-
dc.date.issued	2025	-
dc.identifier.issn	2169-3536	-
dc.identifier.issn	2169-3536	-
dc.identifier.uri	https://scholarworks.dongguk.edu/handle/sw.dongguk/58046	-
dc.description.abstract	Establishing a framework capable of performing 3D object tracking is crucial for various applications, such as autonomous driving and robot navigation. Monocular cameras offer economic and flexible advantages over LiDAR sensors; however, monocular camera-based 3D object detection is fraught with uncertainty owing to difficulty in depth estimation, visual occlusion, and ambiguous object appearance. This uncertainty poses additional challenges to object tracking as well, making performing stable multiple-object tracking on monocular 3D detection results more complex and difficult. To address these challenges in monocular 3D multiple-object detection and tracking, we propose the innovative framework pseudo-LiDAR-MOT, which accurately infers complete 3D bounding box information from 2D image sequences captured by a monocular camera and efficiently correlates and tracks moving objects in the temporal dimension. This framework increases the object detection accuracy and tracking stability of objects in dynamic environments via the following three methods. Monocular depth prediction and 2D object detection networks are leveraged to estimate pixelwise depth and object boundaries. Subsequently, a pseudo-LiDAR approach is employed to generate a 3D point cloud for each object, where 3D point clouds are transformed into 2D representations. Furthermore, a novel 3D detection network based on image-based detection is introduced, simplifying the complexity of conventional 3D point-cloud-based detection while increasing the accuracy of monocular image-based 3D object detection. A specialized association network is designed to enhance object trajectory tracking across frames, thereby improving 3D tracking performance. The effectiveness of our approach was validated through extensive experiments conducted on the KITTI dataset. Our proposed framework exhibits excellent performance in 3D object detection and tracking, outperforming conventional methods. This study introduces innovative ideas for 3D object detection and tracking and establishes a basis for advancing autonomous driving systems.	-
dc.format.extent	13	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	IEEE	-
dc.title	Pseudo-LiDAR With Two-Dimensional Instance for Monocular Three-Dimensional Object Tracking	-
dc.type	Article	-
dc.publisher.location	미국	-
dc.identifier.doi	10.1109/ACCESS.2025.3549790	-
dc.identifier.scopusid	2-s2.0-105001208282	-
dc.identifier.wosid	001447587500027	-
dc.identifier.bibliographicCitation	IEEE Access, v.13, pp 45771 - 45783	-
dc.citation.title	IEEE Access	-
dc.citation.volume	13	-
dc.citation.startPage	45771	-
dc.citation.endPage	45783	-
dc.type.docType	Article	-
dc.description.isOpenAccess	Y	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalResearchArea	Engineering	-
dc.relation.journalResearchArea	Telecommunications	-
dc.relation.journalWebOfScienceCategory	Computer Science, Information Systems	-
dc.relation.journalWebOfScienceCategory	Engineering, Electrical & Electronic	-
dc.relation.journalWebOfScienceCategory	Telecommunications	-
dc.subject.keywordAuthor	Three-dimensional displays	-
dc.subject.keywordAuthor	Object detection	-
dc.subject.keywordAuthor	Trajectory	-
dc.subject.keywordAuthor	Point cloud compression	-
dc.subject.keywordAuthor	Feature extraction	-
dc.subject.keywordAuthor	Accuracy	-
dc.subject.keywordAuthor	Laser radar	-
dc.subject.keywordAuthor	Tracking	-
dc.subject.keywordAuthor	Depth measurement	-
dc.subject.keywordAuthor	Target tracking	-
dc.subject.keywordAuthor	Object detection and tracking	-
dc.subject.keywordAuthor	deep learning	-
dc.subject.keywordAuthor	monocular 3D multiple-object tracking	-

Files in This Item: There are no files associated with this item.

Appears in Collections: ETC > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Cho, Kyung Eun photo

Cho, Kyung Eun: College of Advanced Convergence Engineering (Department of Computer Science and Artificial Intelligence)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE