Detailed Information

Cited 19 time in webofscience Cited 26 time in scopus
Metadata Downloads

Object Detection-Based Video Retargeting With Spatial-Temporal Consistency

Full metadata record
DC Field Value Language
dc.contributor.authorLee, Seung Joon-
dc.contributor.authorLee, Siyeong-
dc.contributor.authorCho, Sung In-
dc.contributor.authorKang, Suk-Ju-
dc.date.accessioned2023-04-27T20:40:46Z-
dc.date.available2023-04-27T20:40:46Z-
dc.date.issued2020-12-
dc.identifier.issn1051-8215-
dc.identifier.issn1558-2205-
dc.identifier.urihttps://scholarworks.dongguk.edu/handle/sw.dongguk/5850-
dc.description.abstractThis study proposes a video retargeting method using deep neural network-based object detection. First, the meaningful regions of the input video denoted by bounding boxes of the object detection are extracted. In this case, the area is defined considering the size and number of bounding boxes for objects detected. The bounding boxes of each frame image are considered as regions of interest (RoIs). Second, the Siamese object tracking network is used to address high computational complexity of the object detection network. By dividing the video into scenes, object detection is performed for the first frame image of each scene to obtain the first bounding box. Object tracking is performed for the next sequential frame image until a scene change is detected. Third, the image is resized in the horizontal direction to alter the aspect ratio of the image and obtain the 1D RoIs of the image by projecting bounding boxes in the vertical direction. Then, the proposed method computes the grid map from the 1D RoIs to calculate new coordinates of each column data of the image. Finally, the retargeted video is obtained by rearranging all retargeted frame images. Comparative experiments conducted with various benchmark methods show an average bidirectional similarity score of 1.92, which is higher than other conventional methods. The proposed method was stable and satisfied viewers without causing cognitive discomfort as conventional methods.-
dc.format.extent6-
dc.language영어-
dc.language.isoENG-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.titleObject Detection-Based Video Retargeting With Spatial-Temporal Consistency-
dc.typeArticle-
dc.publisher.location미국-
dc.identifier.doi10.1109/TCSVT.2020.2981652-
dc.identifier.scopusid2-s2.0-85097778341-
dc.identifier.wosid000597751000004-
dc.identifier.bibliographicCitationIEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, v.30, no.12, pp 4434 - 4439-
dc.citation.titleIEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY-
dc.citation.volume30-
dc.citation.number12-
dc.citation.startPage4434-
dc.citation.endPage4439-
dc.type.docTypeArticle-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.subject.keywordAuthorObject detection-
dc.subject.keywordAuthorObject tracking-
dc.subject.keywordAuthorDistortion-
dc.subject.keywordAuthorIndexes-
dc.subject.keywordAuthorComputational complexity-
dc.subject.keywordAuthorImage sequences-
dc.subject.keywordAuthorOptimization-
dc.subject.keywordAuthorObject detection-
dc.subject.keywordAuthorobject tracking-
dc.subject.keywordAuthorvideo retargeting-
dc.subject.keywordAuthorconvolutional neural network-
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Advanced Convergence Engineering > Department of Computer Science and Artificial Intelligence > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetrics

Total Views & Downloads

BROWSE