PQCAD-DM: Progressive Quantization and Calibration-Assisted Distillation for Efficient Compression of Diffusion Models

Citations

SCOPUS

0

초록

Diffusion models demonstrate outstanding performance in image generation, but their iterative sampling processes incur substantial computational costs and error accumulation, hindering deployment on resource-constrained IoT devices. In this paper, we propose PQCAD-DM, a hybrid compression framework that tightly integrates Progressive Quantization (PQ) and Calibration-Assisted Distillation (CAD). PQ employs a two-stage strategy with momentum-guided adaptive bit-width transitions to suppress instability, while CAD leverages a dual-calibration dataset to reconstruct robust full-precision guidance for low-bit students. Aligned with an industrial server-to-device deployment paradigm, PQCAD-DM optimizes models on high-resource servers for efficient edge inference. Extensive experiments, including validation on aerial and urban surveillance IoT datasets, demonstrate that PQCAD-DM consistently outperforms fixed-bit quantization baselines and architecture-optimized hybrid models. On a mobile smartphone, our framework achieves a 2.8× higher throughput and 48% reduction in model size while maintaining high visual fidelity, making it uniquely suited for latency-sensitive IoT applications. © 2014 IEEE.

키워드

Diffusion ModelsDistillationModel CompressionQuantization
제목
PQCAD-DM: Progressive Quantization and Calibration-Assisted Distillation for Efficient Compression of Diffusion Models
저자
Ko, BeomseokJang, Hyeryung
DOI
10.1109/JIOT.2026.3695280
발행일
2026
유형
Article in press
저널명
IEEE Internet of Things Journal