Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

ROM-Pose: restoring occluded mask image for 2D human pose estimation

Full metadata record
DC Field Value Language
dc.contributor.authorLee, Yunju-
dc.contributor.authorKim, Jihie-
dc.date.accessioned2025-06-12T05:41:50Z-
dc.date.available2025-06-12T05:41:50Z-
dc.date.issued2025-05-
dc.identifier.issn2376-5992-
dc.identifier.issn2376-5992-
dc.identifier.urihttps://scholarworks.dongguk.edu/handle/sw.dongguk/58430-
dc.description.abstractHuman pose estimation (HPE) is a field focused on estimating human poses by detecting key points in images. HPE includes methods like top-down and bottom-up approaches. The top-down approach uses a two-stage process, first locating and then detecting key points on humans with bounding boxes, whereas the bottom-up approach directly detects individual key points and integrates them to estimate the overall pose. In this article, we address the problem of bounding box detection inaccuracies in certain situations using the top-down method. The detected bounding boxes, which serve as input for the model, impact the accuracy of pose estimation. Occlusions occur when a part of the target's body is obscured by a person or object and hinder the model's ability to detect complete bounding boxes. Consequently, the model produces bounding boxes that do not recognize occluded parts, resulting in their exclusion from the input used by the HPE model. To mitigate this issue, we introduce the Restoring Occluded Mask Image for 2D Human Pose Estimation (ROM-Pose), comprising a restoration model and an HPE model. The restoration model is designed to delineate the boundary between the target's grayscale mask (occluded image) and the blocker's grayscale mask (occludee image) using the specially created Whole Common Objects in Context (COCO) dataset. Upon identifying the boundary, the restoration model restores the occluded image. This restored image is subsequently overlaid onto the RGB image for use in the HPE model. By integrating occluded parts' information into the input, the bounding box includes these areas during detection, thus enhancing the HPE model's ability to recognize them. ROM-Pose achieved a 1.6% improvement in average precision (AP) compared to the baseline.-
dc.language영어-
dc.language.isoENG-
dc.publisherPEERJ INC-
dc.titleROM-Pose: restoring occluded mask image for 2D human pose estimation-
dc.typeArticle-
dc.publisher.location영국-
dc.identifier.doi10.7717/peerj-cs.2843-
dc.identifier.scopusid2-s2.0-105005184174-
dc.identifier.wosid001488650500001-
dc.identifier.bibliographicCitationPeerJ Computer Science, v.11-
dc.citation.titlePeerJ Computer Science-
dc.citation.volume11-
dc.type.docTypeArticle-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalWebOfScienceCategoryComputer Science, Artificial Intelligence-
dc.relation.journalWebOfScienceCategoryComputer Science, Information Systems-
dc.relation.journalWebOfScienceCategoryComputer Science, Theory & Methods-
dc.subject.keywordAuthorHuman pose estimation-
dc.subject.keywordAuthorEstimation-
dc.subject.keywordAuthorSegmentation-
dc.subject.keywordAuthorRestoration-
dc.subject.keywordAuthorAmodal instance segmentation-
Files in This Item
There are no files associated with this item.
Appears in
Collections
ETC > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Ji Hie photo

Kim, Ji Hie
College of Advanced Convergence Engineering (Department of Computer Science and Artificial Intelligence)
Read more

Altmetrics

Total Views & Downloads

BROWSE