Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Data Augmentation Techniques Using Text-to-Image Diffusion Models for Enhanced Data Diversity

Full metadata record
DC Field Value Language
dc.contributor.authorShin, Jeongmin-
dc.contributor.authorJang, Hyeryung-
dc.date.accessioned2025-03-12T03:00:11Z-
dc.date.available2025-03-12T03:00:11Z-
dc.date.issued2024-
dc.identifier.issn2162-1233-
dc.identifier.issn2162-1241-
dc.identifier.urihttps://scholarworks.dongguk.edu/handle/sw.dongguk/57910-
dc.description.abstractData augmentation is a widely used technique to enhance the performance of deep learning models. However, traditional augmentation methods, dependent solely on original data, often fall short in maintaining data diversity and generalization capabilities. In this paper, we propose a novel data augmentation approach leveraging pretrained text-to-image diffusion models to generate diverse and contextually rich images. Our approach integrates three advanced techniques: rich-text prompts, multi-object image generation, and inpainting. We demonstrate the effectiveness of these methods through extensive experiments on the Oxford-IIIT Pets and Caltech-101 datasets, where our diffusion-based augmentations significantly improved downstream classification accuracy and model generalization. No-tably, the inpainting technique excels in handling class imbalances by balancing the diversity and structural integrity of original data, while rich-text prompts and multi-object generation offer substantial gains by enhancing diversity and realism. Additionally, our methods show enhanced generalization to unseen data, proving their robustness and applicability to various deep learning tasks. © 2024 IEEE.-
dc.format.extent6-
dc.language영어-
dc.language.isoENG-
dc.publisherIEEE-
dc.titleData Augmentation Techniques Using Text-to-Image Diffusion Models for Enhanced Data Diversity-
dc.typeArticle-
dc.publisher.location미국-
dc.identifier.doi10.1109/ICTC62082.2024.10827311-
dc.identifier.scopusid2-s2.0-85217671096-
dc.identifier.bibliographicCitation2024 15th International Conference on Information and Communication Technology Convergence (ICTC), pp 2027 - 2032-
dc.citation.title2024 15th International Conference on Information and Communication Technology Convergence (ICTC)-
dc.citation.startPage2027-
dc.citation.endPage2032-
dc.type.docTypeConference paper-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.subject.keywordAuthorAdversarial Machine Learning-
dc.subject.keywordAuthorSpatio-temporal Data-
dc.subject.keywordAuthorAugmentation Methods-
dc.subject.keywordAuthorAugmentation Techniques-
dc.subject.keywordAuthorData Augmentation-
dc.subject.keywordAuthorDiffusion Model-
dc.subject.keywordAuthorGeneralization Capability-
dc.subject.keywordAuthorImage Diffusion-
dc.subject.keywordAuthorLearning Models-
dc.subject.keywordAuthorMultiobject-
dc.subject.keywordAuthorPerformance-
dc.subject.keywordAuthorRich Texts-
dc.subject.keywordAuthorContrastive Learning-
Files in This Item
There are no files associated with this item.
Appears in
Collections
ETC > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Jang, Hye Ryung photo

Jang, Hye Ryung
College of Advanced Convergence Engineering (Department of Computer Science and Artificial Intelligence)
Read more

Altmetrics

Total Views & Downloads

BROWSE