Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Text-guided diffusion-based restoration of extremely compressed backgrounds for VCM

Full metadata record
DC Field Value Language
dc.contributor.authorLe Thi Hue Dao-
dc.contributor.authorYang, Naeun-
dc.contributor.authorLee, Jooyoung-
dc.contributor.authorJeong, Seyoon-
dc.contributor.authorLee, Chul-
dc.date.accessioned2026-02-19T06:00:20Z-
dc.date.available2026-02-19T06:00:20Z-
dc.date.issued2026-04-
dc.identifier.issn2405-9595-
dc.identifier.issn2405-9595-
dc.identifier.urihttps://scholarworks.dongguk.edu/handle/sw.dongguk/63737-
dc.description.abstractRestoring high-quality images from severely degraded inputs is essential for video coding for machines (VCM), where background regions are compressed at extremely low bitrates. In this letter, we propose a novel text-guided diffusion-based restoration (TGDR) algorithm, which integrates semantic information from text captions to guide the restoration process. Specifically, we develop a refinement block that incorporates a transformer-based time-aware feature extractor to fuse visual features, time-step embeddings, and textual semantics adaptively to guide a pretrained diffusion model during the reverse denoising process. By incorporating both visual and textual information, TGDR effectively reconstructs complex structures and improves semantic consistency in highly compressed regions. Experimental results show that TGDR achieves superior performance compared to state-of-the-art algorithms. © 2026 The Authors.-
dc.format.extent6-
dc.language영어-
dc.language.isoENG-
dc.publisher한국통신학회-
dc.titleText-guided diffusion-based restoration of extremely compressed backgrounds for VCM-
dc.typeArticle-
dc.publisher.location대한민국-
dc.identifier.doi10.1016/j.icte.2026.01.011-
dc.identifier.scopusid2-s2.0-105029086915-
dc.identifier.bibliographicCitationICT Express, v.12, no.2, pp 487 - 492-
dc.citation.titleICT Express-
dc.citation.volume12-
dc.citation.number2-
dc.citation.startPage487-
dc.citation.endPage492-
dc.type.docTypeArticle in press-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.description.journalRegisteredClasskci-
dc.subject.keywordAuthorDiffusion model-
dc.subject.keywordAuthorImage generation-
dc.subject.keywordAuthorImage restoration-
dc.subject.keywordAuthorVideo coding for machines (VCM)-
Files in This Item
There are no files associated with this item.
Appears in
Collections
ETC > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Lee, Chul photo

Lee, Chul
College of Advanced Convergence Engineering (Department of Computer Science and Artificial Intelligence)
Read more

Altmetrics

Total Views & Downloads

BROWSE