Cited 0 time in
Text-guided diffusion-based restoration of extremely compressed backgrounds for VCM
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Le Thi Hue Dao | - |
| dc.contributor.author | Yang, Naeun | - |
| dc.contributor.author | Lee, Jooyoung | - |
| dc.contributor.author | Jeong, Seyoon | - |
| dc.contributor.author | Lee, Chul | - |
| dc.date.accessioned | 2026-02-19T06:00:20Z | - |
| dc.date.available | 2026-02-19T06:00:20Z | - |
| dc.date.issued | 2026-04 | - |
| dc.identifier.issn | 2405-9595 | - |
| dc.identifier.issn | 2405-9595 | - |
| dc.identifier.uri | https://scholarworks.dongguk.edu/handle/sw.dongguk/63737 | - |
| dc.description.abstract | Restoring high-quality images from severely degraded inputs is essential for video coding for machines (VCM), where background regions are compressed at extremely low bitrates. In this letter, we propose a novel text-guided diffusion-based restoration (TGDR) algorithm, which integrates semantic information from text captions to guide the restoration process. Specifically, we develop a refinement block that incorporates a transformer-based time-aware feature extractor to fuse visual features, time-step embeddings, and textual semantics adaptively to guide a pretrained diffusion model during the reverse denoising process. By incorporating both visual and textual information, TGDR effectively reconstructs complex structures and improves semantic consistency in highly compressed regions. Experimental results show that TGDR achieves superior performance compared to state-of-the-art algorithms. © 2026 The Authors. | - |
| dc.format.extent | 6 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | 한국통신학회 | - |
| dc.title | Text-guided diffusion-based restoration of extremely compressed backgrounds for VCM | - |
| dc.type | Article | - |
| dc.publisher.location | 대한민국 | - |
| dc.identifier.doi | 10.1016/j.icte.2026.01.011 | - |
| dc.identifier.scopusid | 2-s2.0-105029086915 | - |
| dc.identifier.bibliographicCitation | ICT Express, v.12, no.2, pp 487 - 492 | - |
| dc.citation.title | ICT Express | - |
| dc.citation.volume | 12 | - |
| dc.citation.number | 2 | - |
| dc.citation.startPage | 487 | - |
| dc.citation.endPage | 492 | - |
| dc.type.docType | Article in press | - |
| dc.description.isOpenAccess | Y | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.description.journalRegisteredClass | kci | - |
| dc.subject.keywordAuthor | Diffusion model | - |
| dc.subject.keywordAuthor | Image generation | - |
| dc.subject.keywordAuthor | Image restoration | - |
| dc.subject.keywordAuthor | Video coding for machines (VCM) | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114
Copyright(c) 2023 DONGGUK UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
