Text-guided diffusion-based restoration of extremely compressed backgrounds for VCM

Le Thi Hue Dao; Yang, Naeun; Lee, Jooyoung; Jeong, Seyoon; Lee, Chul

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Text-guided diffusion-based restoration of extremely compressed backgrounds for VCMopen access

Authors: Le Thi Hue Dao; Yang, Naeun; Lee, Jooyoung; Jeong, Seyoon; Lee, Chul

Issue Date: Apr-2026

Publisher: 한국통신학회

Keywords: Diffusion model; Image generation; Image restoration; Video coding for machines (VCM)

Citation: ICT Express, v.12, no.2, pp 487 - 492

Pages: 6

Indexed: SCIE
SCOPUS
KCI

Journal Title: ICT Express

Volume: 12

Number: 2

Start Page: 487

End Page: 492

URI: https://scholarworks.dongguk.edu/handle/sw.dongguk/63737

DOI: 10.1016/j.icte.2026.01.011

ISSN: 2405-9595
2405-9595

Abstract: Restoring high-quality images from severely degraded inputs is essential for video coding for machines (VCM), where background regions are compressed at extremely low bitrates. In this letter, we propose a novel text-guided diffusion-based restoration (TGDR) algorithm, which integrates semantic information from text captions to guide the restoration process. Specifically, we develop a refinement block that incorporates a transformer-based time-aware feature extractor to fuse visual features, time-step embeddings, and textual semantics adaptively to guide a pretrained diffusion model during the reverse denoising process. By incorporating both visual and textual information, TGDR effectively reconstructs complex structures and improves semantic consistency in highly compressed regions. Experimental results show that TGDR achieves superior performance compared to state-of-the-art algorithms. © 2026 The Authors.

Files in This Item: There are no files associated with this item.

Appears in Collections: ETC > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Lee, Chul photo

Lee, Chul: College of Advanced Convergence Engineering (Department of Computer Science and Artificial Intelligence)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE