AI 목소리와 OTT 콘텐츠의 사운드 미학
AI Voice and Sound Aesthetics in OTT Content
Citations

WEB OF SCIENCE

0
Citations

SCOPUS

0

초록

In cinema, the voice has long served as a crucial auditory resource for constructing emotion, identity, and narrative flow. Recent advancements in artificial intelligence (AI)-based voice synthesis technologies have initiated a significant technical shift in how voices are produced, particularly within the Over-the-Top (OTT) content environment, which demands multilingual delivery and rapid production cycles. This study examines the application of AI voice technologies—such as Text-to-Speech(TTS) and voice cloning—in film and OTT-based audiovisual content through practical production contexts and real-world case studies. By comparing major commercial platforms such as ElevenLabs, Typecast, and CLOVA Dubbing, this paper analyzes their technical architectures, emotional control capabilities, and options for custom voice configuration. Drawing on the author’s direct involvement in the sound design of the film, the study further investigates how AI voices are integrated into actual production workflows. Special attention is given to features such as emotional modulation and prosody control, which can enhance narrative immersion, while also acknowledging persistent limitations, including lip-sync precision and expressive nuance. This research aims to provide a balanced assessment of both the potential and constraints of AI-generated voices as emerging cinematic resources, grounded in a practical and production-oriented perspective.

키워드

AI voiceOTTvoice cloningTTScinema sound
제목
AI 목소리와 OTT 콘텐츠의 사운드 미학
제목 (타언어)
AI Voice and Sound Aesthetics in OTT Content
저자
최지현이원덕
DOI
10.34269/mitak.2025.1.48.002
발행일
2025-09
유형
Y
저널명
영상기술연구
1
48
페이지
27 ~ 51