Text-guided diffusion-based restoration of extremely compressed backgrounds for VCM
  • Le Thi Hue Dao
  • Yang, Naeun
  • Lee, Jooyoung
  • Jeong, Seyoon
  • Lee, Chul
Citations

WEB OF SCIENCE

0
Citations

SCOPUS

0

초록

Restoring high-quality images from severely degraded inputs is essential for video coding for machines (VCM), where background regions are compressed at extremely low bitrates. In this letter, we propose a novel text-guided diffusion-based restoration (TGDR) algorithm, which integrates semantic information from text captions to guide the restoration process. Specifically, we develop a refinement block that incorporates a transformer-based time-aware feature extractor to fuse visual features, time-step embeddings, and textual semantics adaptively to guide a pretrained diffusion model during the reverse denoising process. By incorporating both visual and textual information, TGDR effectively reconstructs complex structures and improves semantic consistency in highly compressed regions. Experimental results show that TGDR achieves superior performance compared to state-of-the-art algorithms. © 2026 The Authors.

키워드

Diffusion modelImage generationImage restorationVideo coding for machines (VCM)
제목
Text-guided diffusion-based restoration of extremely compressed backgrounds for VCM
저자
Le Thi Hue DaoYang, NaeunLee, JooyoungJeong, SeyoonLee, Chul
DOI
10.1016/j.icte.2026.01.011
발행일
2026-04
유형
Article
저널명
ICT Express
12
2
페이지
487 ~ 492