Text-guided diffusion-based restoration of extremely compressed backgrounds for VCMopen access
- Authors
- Le Thi Hue Dao; Yang, Naeun; Lee, Jooyoung; Jeong, Seyoon; Lee, Chul
- Issue Date
- Apr-2026
- Publisher
- 한국통신학회
- Keywords
- Diffusion model; Image generation; Image restoration; Video coding for machines (VCM)
- Citation
- ICT Express, v.12, no.2, pp 487 - 492
- Pages
- 6
- Indexed
- SCIE
SCOPUS
KCI
- Journal Title
- ICT Express
- Volume
- 12
- Number
- 2
- Start Page
- 487
- End Page
- 492
- URI
- https://scholarworks.dongguk.edu/handle/sw.dongguk/63737
- DOI
- 10.1016/j.icte.2026.01.011
- ISSN
- 2405-9595
2405-9595
- Abstract
- Restoring high-quality images from severely degraded inputs is essential for video coding for machines (VCM), where background regions are compressed at extremely low bitrates. In this letter, we propose a novel text-guided diffusion-based restoration (TGDR) algorithm, which integrates semantic information from text captions to guide the restoration process. Specifically, we develop a refinement block that incorporates a transformer-based time-aware feature extractor to fuse visual features, time-step embeddings, and textual semantics adaptively to guide a pretrained diffusion model during the reverse denoising process. By incorporating both visual and textual information, TGDR effectively reconstructs complex structures and improves semantic consistency in highly compressed regions. Experimental results show that TGDR achieves superior performance compared to state-of-the-art algorithms. © 2026 The Authors.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - ETC > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.