Detailed Information

Cited 5 time in webofscience Cited 9 time in scopus
Metadata Downloads

Realistic Image Generation from Text by Using BERT-Based Embedding

Full metadata record
DC Field Value Language
dc.contributor.authorNa, Sanghyuck-
dc.contributor.authorDo, Mirae-
dc.contributor.authorYu, Kyeonah-
dc.contributor.authorKim, Juntae-
dc.date.accessioned2023-04-27T12:41:03Z-
dc.date.available2023-04-27T12:41:03Z-
dc.date.issued2022-03-
dc.identifier.issn2079-9292-
dc.identifier.issn2079-9292-
dc.identifier.urihttps://scholarworks.dongguk.edu/handle/sw.dongguk/3543-
dc.description.abstractRecently, in the field of artificial intelligence, multimodal learning has received a lot of attention due to expectations for the enhancement of AI performance and potential applications. Text-to-image generation, which is one of the multimodal tasks, is a challenging topic in computer vision and natural language processing. The text-to-image generation model based on generative adversarial network (GAN) utilizes a text encoder pre-trained with image-text pairs. However, text encoders pre-trained with image-text pairs cannot obtain rich information about texts not seen during pre-training, thus it is hard to generate an image that semantically matches a given text description. In this paper, we propose a new text-to-image generation model using pre-trained BERT, which is widely used in the field of natural language processing. The pre-trained BERT is used as a text encoder by performing fine-tuning with a large amount of text, so that rich information about the text is obtained and thus suitable for the image generation task. Through experiments using a multimodal benchmark dataset, we show that the proposed method improves the performance over the baseline model both quantitatively and qualitatively.-
dc.format.extent11-
dc.language영어-
dc.language.isoENG-
dc.publisherMDPI-
dc.titleRealistic Image Generation from Text by Using BERT-Based Embedding-
dc.typeArticle-
dc.publisher.location스위스-
dc.identifier.doi10.3390/electronics11050764-
dc.identifier.scopusid2-s2.0-85125419852-
dc.identifier.wosid000771163500001-
dc.identifier.bibliographicCitationElectronics, v.11, no.5, pp 1 - 11-
dc.citation.titleElectronics-
dc.citation.volume11-
dc.citation.number5-
dc.citation.startPage1-
dc.citation.endPage11-
dc.type.docTypeArticle-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaPhysics-
dc.relation.journalWebOfScienceCategoryComputer Science, Information Systems-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.relation.journalWebOfScienceCategoryPhysics, Applied-
dc.subject.keywordAuthortext to image generation-
dc.subject.keywordAuthormultimodal data-
dc.subject.keywordAuthorBERT-
dc.subject.keywordAuthorGAN-
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Advanced Convergence Engineering > Department of Computer Science and Artificial Intelligence > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Jun Tae photo

Kim, Jun Tae
College of Advanced Convergence Engineering (Department of Computer Science and Artificial Intelligence)
Read more

Altmetrics

Total Views & Downloads

BROWSE