상세 보기
- 최지현;
- 이원덕
WEB OF SCIENCE
0SCOPUS
0초록
In cinema, the voice has long served as a crucial auditory resource for constructing emotion, identity, and narrative flow. Recent advancements in artificial intelligence (AI)-based voice synthesis technologies have initiated a significant technical shift in how voices are produced, particularly within the Over-the-Top (OTT) content environment, which demands multilingual delivery and rapid production cycles. This study examines the application of AI voice technologies—such as Text-to-Speech(TTS) and voice cloning—in film and OTT-based audiovisual content through practical production contexts and real-world case studies. By comparing major commercial platforms such as ElevenLabs, Typecast, and CLOVA Dubbing, this paper analyzes their technical architectures, emotional control capabilities, and options for custom voice configuration. Drawing on the author’s direct involvement in the sound design of the film, the study further investigates how AI voices are integrated into actual production workflows. Special attention is given to features such as emotional modulation and prosody control, which can enhance narrative immersion, while also acknowledging persistent limitations, including lip-sync precision and expressive nuance. This research aims to provide a balanced assessment of both the potential and constraints of AI-generated voices as emerging cinematic resources, grounded in a practical and production-oriented perspective.
키워드
- 제목
- AI 목소리와 OTT 콘텐츠의 사운드 미학
- 제목 (타언어)
- AI Voice and Sound Aesthetics in OTT Content
- 저자
- 최지현; 이원덕
- 발행일
- 2025-09
- 유형
- Y
- 저널명
- 영상기술연구
- 권
- 1
- 호
- 48
- 페이지
- 27 ~ 51