Text-guided diffusion-based restoration of extremely compressed backgrounds for VCM

Le Thi Hue Dao; Yang, Naeun; Lee, Jooyoung; Jeong, Seyoon; Lee, Chul

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Text-guided diffusion-based restoration of extremely compressed backgrounds for VCM

Full metadata record

DC Field	Value	Language
dc.contributor.author	Le Thi Hue Dao	-
dc.contributor.author	Yang, Naeun	-
dc.contributor.author	Lee, Jooyoung	-
dc.contributor.author	Jeong, Seyoon	-
dc.contributor.author	Lee, Chul	-
dc.date.accessioned	2026-02-19T06:00:20Z	-
dc.date.available	2026-02-19T06:00:20Z	-
dc.date.issued	2026-04	-
dc.identifier.issn	2405-9595	-
dc.identifier.issn	2405-9595	-
dc.identifier.uri	https://scholarworks.dongguk.edu/handle/sw.dongguk/63737	-
dc.description.abstract	Restoring high-quality images from severely degraded inputs is essential for video coding for machines (VCM), where background regions are compressed at extremely low bitrates. In this letter, we propose a novel text-guided diffusion-based restoration (TGDR) algorithm, which integrates semantic information from text captions to guide the restoration process. Specifically, we develop a refinement block that incorporates a transformer-based time-aware feature extractor to fuse visual features, time-step embeddings, and textual semantics adaptively to guide a pretrained diffusion model during the reverse denoising process. By incorporating both visual and textual information, TGDR effectively reconstructs complex structures and improves semantic consistency in highly compressed regions. Experimental results show that TGDR achieves superior performance compared to state-of-the-art algorithms. © 2026 The Authors.	-
dc.format.extent	6	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	한국통신학회	-
dc.title	Text-guided diffusion-based restoration of extremely compressed backgrounds for VCM	-
dc.type	Article	-
dc.publisher.location	대한민국	-
dc.identifier.doi	10.1016/j.icte.2026.01.011	-
dc.identifier.scopusid	2-s2.0-105029086915	-
dc.identifier.bibliographicCitation	ICT Express, v.12, no.2, pp 487 - 492	-
dc.citation.title	ICT Express	-
dc.citation.volume	12	-
dc.citation.number	2	-
dc.citation.startPage	487	-
dc.citation.endPage	492	-
dc.type.docType	Article in press	-
dc.description.isOpenAccess	Y	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.description.journalRegisteredClass	kci	-
dc.subject.keywordAuthor	Diffusion model	-
dc.subject.keywordAuthor	Image generation	-
dc.subject.keywordAuthor	Image restoration	-
dc.subject.keywordAuthor	Video coding for machines (VCM)	-

Files in This Item: There are no files associated with this item.

Appears in Collections: ETC > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Lee, Chul photo

Lee, Chul: College of Advanced Convergence Engineering (Department of Computer Science and Artificial Intelligence)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE