Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

A Method to Real-Time Update Speaker Pronunciation Time-Database for the Application of Informatized Caption Enhancement by IBM Watson API

Authors
Choi, Y.-S.Kim, I.-H.Yang, H.-M.Lim, D.-W.Lin, A.Jung, J.-W.
Issue Date
2019
Publisher
Springer Verlag
Keywords
IBM Watson API; Informatized caption; Speaker pronunciation time-DB; Speech recognition
Citation
Lecture Notes in Electrical Engineering, v.542, pp 490 - 495
Pages
6
Indexed
SCOPUS
Journal Title
Lecture Notes in Electrical Engineering
Volume
542
Start Page
490
End Page
495
URI
https://scholarworks.dongguk.edu/handle/sw.dongguk/8551
DOI
10.1007/978-981-13-3648-5_56
ISSN
1876-1100
1876-1119
Abstract
One of the major AI research fields is natural language processing by speech recognition. IBM Watson is one of the representative tools for this speech recognition system which can automatically generate not only the recognized words from voice signal but also the speaker ID and timing information of each words including the starting time and the ending time. However, IBM Watson is not enough good and easily generate incorrect recognition output when there are some noise in the audio signal, especially for movies where background music and special sound effects are incorporated together. There were some studies to solve this problem using the IBM Watson API based on the assumption that speaker pronunciation time DB was already implemented properly. But, it is not easy to make speaker pronunciation time DB and it requires big cost. In this paper, to resolve this problem of speaker pronunciation time DB, we introduce an efficient method to implement and update the speaker pronunciation time DB in real time. © 2019, Springer Nature Singapore Pte Ltd.
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Advanced Convergence Engineering > Department of Computer Science and Artificial Intelligence > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Jung, Jin Woo photo

Jung, Jin Woo
College of Advanced Convergence Engineering (Department of Computer Science and Artificial Intelligence)
Read more

Altmetrics

Total Views & Downloads

BROWSE