Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Impact of Word Embedding Methods on Performance of Sentiment Analysis with Machine Learning TechniquesImpact of Word Embedding Methods on Performance of Sentiment Analysis with Machine Learning Techniques

Other Titles
Impact of Word Embedding Methods on Performance of Sentiment Analysis with Machine Learning Techniques
Authors
박호연김경재
Issue Date
Aug-2020
Publisher
한국컴퓨터정보학회
Keywords
sentiment analysis; Bag of words; TF-IDF; Word2Vec; machine learning; 감성분석; Bag of words; TF-IDF; Word2Vec; 기계 학습
Citation
한국컴퓨터정보학회논문지, v.25, no.8, pp 181 - 188
Pages
8
Indexed
KCI
Journal Title
한국컴퓨터정보학회논문지
Volume
25
Number
8
Start Page
181
End Page
188
URI
https://scholarworks.dongguk.edu/handle/sw.dongguk/6351
DOI
10.9708/jksci.2020.25.08.181
ISSN
1598-849X
2383-9945
Abstract
In this study, we propose a comparative study to confirm the impact of various word embedding techniques on the performance of sentiment analysis. Sentiment analysis is one of opinion mining techniques to identify and extract subjective information from text using natural language processing and can be used to classify the sentiment of product reviews or comments. Since sentiment can be classified as either positive or negative, it can be considered one of the general classification problems. For sentiment analysis, the text must be converted into a language that can be recognized by a computer. Therefore, text such as a word or document is transformed into a vector in natural language processing called word embedding. Various techniques, such as Bag of Words, TF-IDF, and Word2Vec are used as word embedding techniques. Until now, there have not been many studies on word embedding techniques suitable for emotional analysis. In this study, among various word embedding techniques, Bag of Words, TF-IDF, and Word2Vec are used to compare and analyze the performance of movie review sentiment analysis. The research data set for this study is the IMDB data set, which is widely used in text mining. As a result, it was found that the performance of TF-IDF and Bag of Words was superior to that of Word2Vec and TF-IDF performed better than Bag of Words, but the difference was not very significant.
Files in This Item
There are no files associated with this item.
Appears in
Collections
Dongguk Business School > Department of Management Information System > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Kyong Jae photo

Kim, Kyong Jae
Dongguk Business School (Department of Management Information System)
Read more

Altmetrics

Total Views & Downloads

BROWSE