Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

A Survey of Generative Models for Image and Video with Diffusion Model

Authors
Koh, Byoung SooPark, Hyeong CheolPark, Jin Ho
Issue Date
Dec-2024
Publisher
한국컴퓨터산업협회
Keywords
Deep Learning; Generative Model; Diffusion Model; Multimodal Learning
Citation
Human-centric Computing and Information Sciences, v.14, pp 1 - 20
Pages
20
Indexed
SCIE
SCOPUS
KCI
Journal Title
Human-centric Computing and Information Sciences
Volume
14
Start Page
1
End Page
20
URI
https://scholarworks.dongguk.edu/handle/sw.dongguk/56328
DOI
10.22967/HCIS.2024.14.069
ISSN
2192-1962
2192-1962
Abstract
With recent advances in deep learning-based generative models, it is now possible to synthesize realistic data in a diverse domain. One notable method in the generative model is a diffusion-based generative model that generates realistic and high-quality images and videos. Diffusion-based generative model leverages a diffusion process to transform a Gaussian noise distribution into a complex, realistic data distribution. To illustrate the diffusion-based generative models, we give an overview diffusion probabilistic models and denoising diffusion probabilistic models. Especially, we review research that presents new methodologies for image, video, and multimedia contents generation, aiming to understand how those models efficiently learn complex data distribution using various techniques. In the meantime, using multimodal data for training generative models helps them learn more about various representations of complex data distribution, which enhances the generation of diverse images and videos. For the main contribution of this paper, we present several effective methods for synthesizing various types of data using diffusion models and multimodal data, along with their applications. In this context, we believe that presenting how diffusion models have expanded into multimedia generation along with the progression of technological advancements will provide knowledge and inspiration to many researchers.
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Advanced Convergence Engineering > Department of Computer Science and Artificial Intelligence > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Park, Jin Ho photo

Park, Jin Ho
College of Advanced Convergence Engineering (Department of Computer Science and Artificial Intelligence)
Read more

Altmetrics

Total Views & Downloads

BROWSE