EmoBERTa-CNN: Hybrid Deep Learning Approach Capturing Global Semantics and Local Features for Enhanced Emotion Recognition in Conversational Settingsopen access
- Authors
- Zhang, Mingfeng; Yu, Aihe; Sheng, Xuanyu; Park, Jisun; Rhee, Jongtae; Cho, Kyungeun
- Issue Date
- Jul-2025
- Publisher
- MDPI
- Keywords
- pre-trained language model; deep learning; emotion recognition
- Citation
- Mathematics, v.13, no.15, pp 1 - 20
- Pages
- 20
- Indexed
- SCIE
SCOPUS
- Journal Title
- Mathematics
- Volume
- 13
- Number
- 15
- Start Page
- 1
- End Page
- 20
- URI
- https://scholarworks.dongguk.edu/handle/sw.dongguk/58997
- DOI
- 10.3390/math13152438
- ISSN
- 2227-7390
2227-7390
- Abstract
- Emotion recognition in conversations is a key task in natural language processing that enhances the quality of human-computer interactions. Although existing deep learning and Transformer-based pretrained language models have shown remarkably enhanced performances, both approaches have inherent limitations. Deep learning models often fail to capture the global semantic context, whereas Transformer-based pretrained language models can overlook subtle, local emotional cues. To overcome these challenges, we developed EmoBERTa-CNN, a hybrid framework that combines EmoBERTa's ability to capture global semantics with the capability of convolutional neural networks (CNNs) to extract local emotional features. Experiments on the SemEval-2019 Task 3 and Multimodal EmotionLines Dataset (MELD) demonstrated that the proposed EmoBERTa-CNN model achieved F1-scores of 96.0% and 79.45%, respectively, significantly outperforming existing methods and confirming its effectiveness for emotion recognition in conversations.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Engineering > Department of Industrial and Systems Engineering > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.