A Method to Real-Time Update Speaker Pronunciation Time-Database for the Application of Informatized Caption Enhancement by IBM Watson API

Choi, Y.-S.; Kim, I.-H.; Yang, H.-M.; Lim, D.-W.; Lin, A.; Jung, J.-W.

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

A Method to Real-Time Update Speaker Pronunciation Time-Database for the Application of Informatized Caption Enhancement by IBM Watson API

Full metadata record

DC Field	Value	Language
dc.contributor.author	Choi, Y.-S.	-
dc.contributor.author	Kim, I.-H.	-
dc.contributor.author	Yang, H.-M.	-
dc.contributor.author	Lim, D.-W.	-
dc.contributor.author	Lin, A.	-
dc.contributor.author	Jung, J.-W.	-
dc.date.accessioned	2023-04-28T05:41:59Z	-
dc.date.available	2023-04-28T05:41:59Z	-
dc.date.issued	2019	-
dc.identifier.issn	1876-1100	-
dc.identifier.issn	1876-1119	-
dc.identifier.uri	https://scholarworks.dongguk.edu/handle/sw.dongguk/8551	-
dc.description.abstract	One of the major AI research fields is natural language processing by speech recognition. IBM Watson is one of the representative tools for this speech recognition system which can automatically generate not only the recognized words from voice signal but also the speaker ID and timing information of each words including the starting time and the ending time. However, IBM Watson is not enough good and easily generate incorrect recognition output when there are some noise in the audio signal, especially for movies where background music and special sound effects are incorporated together. There were some studies to solve this problem using the IBM Watson API based on the assumption that speaker pronunciation time DB was already implemented properly. But, it is not easy to make speaker pronunciation time DB and it requires big cost. In this paper, to resolve this problem of speaker pronunciation time DB, we introduce an efficient method to implement and update the speaker pronunciation time DB in real time. © 2019, Springer Nature Singapore Pte Ltd.	-
dc.format.extent	6	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	Springer Verlag	-
dc.title	A Method to Real-Time Update Speaker Pronunciation Time-Database for the Application of Informatized Caption Enhancement by IBM Watson API	-
dc.type	Article	-
dc.publisher.location	독일	-
dc.identifier.doi	10.1007/978-981-13-3648-5_56	-
dc.identifier.scopusid	2-s2.0-85066022425	-
dc.identifier.bibliographicCitation	Lecture Notes in Electrical Engineering, v.542, pp 490 - 495	-
dc.citation.title	Lecture Notes in Electrical Engineering	-
dc.citation.volume	542	-
dc.citation.startPage	490	-
dc.citation.endPage	495	-
dc.type.docType	Conference Paper	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scopus	-
dc.subject.keywordAuthor	IBM Watson API	-
dc.subject.keywordAuthor	Informatized caption	-
dc.subject.keywordAuthor	Speaker pronunciation time-DB	-
dc.subject.keywordAuthor	Speech recognition	-

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Advanced Convergence Engineering > Department of Computer Science and Artificial Intelligence > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Jung, Jin Woo photo

Jung, Jin Woo: College of Advanced Convergence Engineering (Department of Computer Science and Artificial Intelligence)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE