Impact of Word Embedding Methods on Performance of Sentiment Analysis with Machine Learning TechniquesImpact of Word Embedding Methods on Performance of Sentiment Analysis with Machine Learning Techniques
- Other Titles
- Impact of Word Embedding Methods on Performance of Sentiment Analysis with Machine Learning Techniques
- Authors
- 박호연; 김경재
- Issue Date
- Aug-2020
- Publisher
- 한국컴퓨터정보학회
- Keywords
- sentiment analysis; Bag of words; TF-IDF; Word2Vec; machine learning; 감성분석; Bag of words; TF-IDF; Word2Vec; 기계 학습
- Citation
- 한국컴퓨터정보학회논문지, v.25, no.8, pp 181 - 188
- Pages
- 8
- Indexed
- KCI
- Journal Title
- 한국컴퓨터정보학회논문지
- Volume
- 25
- Number
- 8
- Start Page
- 181
- End Page
- 188
- URI
- https://scholarworks.dongguk.edu/handle/sw.dongguk/6351
- DOI
- 10.9708/jksci.2020.25.08.181
- ISSN
- 1598-849X
2383-9945
- Abstract
- In this study, we propose a comparative study to confirm the impact of various word embedding techniques on the performance of sentiment analysis. Sentiment analysis is one of opinion mining techniques to identify and extract subjective information from text using natural language processing and can be used to classify the sentiment of product reviews or comments. Since sentiment can be classified as either positive or negative, it can be considered one of the general classification problems. For sentiment analysis, the text must be converted into a language that can be recognized by a computer. Therefore, text such as a word or document is transformed into a vector in natural language processing called word embedding. Various techniques, such as Bag of Words, TF-IDF, and Word2Vec are used as word embedding techniques. Until now, there have not been many studies on word embedding techniques suitable for emotional analysis. In this study, among various word embedding techniques, Bag of Words, TF-IDF, and Word2Vec are used to compare and analyze the performance of movie review sentiment analysis. The research data set for this study is the IMDB data set, which is widely used in text mining. As a result, it was found that the performance of TF-IDF and Bag of Words was superior to that of Word2Vec and TF-IDF performed better than Bag of Words, but the difference was not very significant.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - Dongguk Business School > Department of Management Information System > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.