A Survey of Generative Models for Image and Video with Diffusion Model
- Authors
- Koh, Byoung Soo; Park, Hyeong Cheol; Park, Jin Ho
- Issue Date
- Dec-2024
- Publisher
- 한국컴퓨터산업협회
- Keywords
- Deep Learning; Generative Model; Diffusion Model; Multimodal Learning
- Citation
- Human-centric Computing and Information Sciences, v.14, pp 1 - 20
- Pages
- 20
- Indexed
- SCIE
SCOPUS
KCI
- Journal Title
- Human-centric Computing and Information Sciences
- Volume
- 14
- Start Page
- 1
- End Page
- 20
- URI
- https://scholarworks.dongguk.edu/handle/sw.dongguk/56328
- DOI
- 10.22967/HCIS.2024.14.069
- ISSN
- 2192-1962
2192-1962
- Abstract
- With recent advances in deep learning-based generative models, it is now possible to synthesize realistic data in a diverse domain. One notable method in the generative model is a diffusion-based generative model that generates realistic and high-quality images and videos. Diffusion-based generative model leverages a diffusion process to transform a Gaussian noise distribution into a complex, realistic data distribution. To illustrate the diffusion-based generative models, we give an overview diffusion probabilistic models and denoising diffusion probabilistic models. Especially, we review research that presents new methodologies for image, video, and multimedia contents generation, aiming to understand how those models efficiently learn complex data distribution using various techniques. In the meantime, using multimodal data for training generative models helps them learn more about various representations of complex data distribution, which enhances the generation of diverse images and videos. For the main contribution of this paper, we present several effective methods for synthesizing various types of data using diffusion models and multimodal data, along with their applications. In this context, we believe that presenting how diffusion models have expanded into multimedia generation along with the progression of technological advancements will provide knowledge and inspiration to many researchers.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Advanced Convergence Engineering > Department of Computer Science and Artificial Intelligence > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.