PQCAD-DM: Progressive Quantization and Calibration-Assisted Distillation for Efficient Compression of Diffusion Models

Ko, Beomseok; Jang, Hyeryung

doi:10.1109/JIOT.2026.3695280

상세 보기

PQCAD-DM: Progressive Quantization and Calibration-Assisted Distillation for Efficient Compression of Diffusion Models

Ko, Beomseok;
Jang, Hyeryung

Citations

SCOPUS

0

초록

Diffusion models demonstrate outstanding performance in image generation, but their iterative sampling processes incur substantial computational costs and error accumulation, hindering deployment on resource-constrained IoT devices. In this paper, we propose PQCAD-DM, a hybrid compression framework that tightly integrates Progressive Quantization (PQ) and Calibration-Assisted Distillation (CAD). PQ employs a two-stage strategy with momentum-guided adaptive bit-width transitions to suppress instability, while CAD leverages a dual-calibration dataset to reconstruct robust full-precision guidance for low-bit students. Aligned with an industrial server-to-device deployment paradigm, PQCAD-DM optimizes models on high-resource servers for efficient edge inference. Extensive experiments, including validation on aerial and urban surveillance IoT datasets, demonstrate that PQCAD-DM consistently outperforms fixed-bit quantization baselines and architecture-optimized hybrid models. On a mobile smartphone, our framework achieves a 2.8× higher throughput and 48% reduction in model size while maintaining high visual fidelity, making it uniquely suited for latency-sensitive IoT applications. © 2014 IEEE.

키워드

Diffusion Models; Distillation; Model Compression; Quantization

제목: PQCAD-DM: Progressive Quantization and Calibration-Assisted Distillation for Efficient Compression of Diffusion Models

저자: Ko, Beomseok; Jang, Hyeryung

DOI: 10.1109/JIOT.2026.3695280

발행일: 2026

유형: Article in press

저널명: IEEE Internet of Things Journal