Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Real-time Informatized caption enhancement based on speaker pronunciation time database

Full metadata record
DC Field Value Language
dc.contributor.authorChoi, Yong-Sik-
dc.contributor.authorKang, Jin-Gu-
dc.contributor.authorJoo, Jong Wha J.-
dc.contributor.authorJung, Jin-Woo-
dc.date.accessioned2024-08-08T09:01:28Z-
dc.date.available2024-08-08T09:01:28Z-
dc.date.issued2020-12-
dc.identifier.issn1380-7501-
dc.identifier.issn1573-7721-
dc.identifier.urihttps://scholarworks.dongguk.edu/handle/sw.dongguk/20869-
dc.description.abstractIBM Watson is one of the representative tools for speech recognition system which can automatically generate not only speech-to-text information but also speaker ID and timing information, which is called as Informatized Caption. However, if there is some noise in the voice signal to the IBM Watson API, the recognition performance is significantly decreased. It can be easily found in movies with background music and special sound effects. This paper aims to improve the inaccuracy problem of current Informatized Captions in noisy environments. In this paper, a method of modifying incorrectly recognized words and a method of enhancing timing accuracy while updating database in real time are suggested based on the original caption and Informatized Caption information. Experimental results shows that the proposed method can give 81.09% timing accuracy for the case of 10 representative animation, horror and action movies.-
dc.format.extent22-
dc.language영어-
dc.language.isoENG-
dc.publisherSPRINGER-
dc.titleReal-time Informatized caption enhancement based on speaker pronunciation time database-
dc.typeArticle-
dc.publisher.location네델란드-
dc.identifier.doi10.1007/s11042-020-09590-2-
dc.identifier.scopusid2-s2.0-85090309398-
dc.identifier.wosid000566318900002-
dc.identifier.bibliographicCitationMULTIMEDIA TOOLS AND APPLICATIONS, v.79, no.47-48, pp 35667 - 35688-
dc.citation.titleMULTIMEDIA TOOLS AND APPLICATIONS-
dc.citation.volume79-
dc.citation.number47-48-
dc.citation.startPage35667-
dc.citation.endPage35688-
dc.type.docTypeArticle-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalWebOfScienceCategoryComputer Science, Information Systems-
dc.relation.journalWebOfScienceCategoryComputer Science, Software Engineering-
dc.relation.journalWebOfScienceCategoryComputer Science, Theory & Methods-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.subject.keywordPlusCENTRAL-LIMIT-THEOREM-
dc.subject.keywordPlusNETWORKS-
dc.subject.keywordAuthorInformatized caption-
dc.subject.keywordAuthorSpeaker pronunciation time-
dc.subject.keywordAuthorIBM Watson API-
dc.subject.keywordAuthorSpeech to text translation-
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Advanced Convergence Engineering > Department of Computer Science and Artificial Intelligence > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Jung, Jin Woo photo

Jung, Jin Woo
College of Advanced Convergence Engineering (Department of Computer Science and Artificial Intelligence)
Read more

Altmetrics

Total Views & Downloads

BROWSE