Data preprocessing and transformation in the sentiment analysis using a deep learning technique
- Authors
- 서혜진; 신정아
- Issue Date
- Mar-2020
- Publisher
- 한국영어학회
- Keywords
- Data preprocessing; Deep learning; Sentiment analysis; Transformation
- Citation
- 영어학, v.20, no.20, pp 42 - 63
- Pages
- 22
- Indexed
- SCOPUS
KCI
- Journal Title
- 영어학
- Volume
- 20
- Number
- 20
- Start Page
- 42
- End Page
- 63
- URI
- https://scholarworks.dongguk.edu/handle/sw.dongguk/7098
- DOI
- 10.15738/kjell.20..202003.42
- ISSN
- 1598-1398
2586-7474
- Abstract
- This study examined how to preprocess and transform data efficiently in order to use deep learning techniques in analyzing linguistic data. Researchers interests in deep learning techniques have explosively increased worldwide; however, it is not easy for them to link linguistics to deep learning techniques or algorithms because linguists do not know how and where to begin in using them. Thus, this study provides the general procedure to train data using deep learning algorithms in practice. In particular, for instance, we focused on how to preprocess and transform Tweet data for a sentiment analysis by using deep learning techniques. In addition, we introduced the latest deep learning algorithm, so-called BERT, in the data preprocessing and transformation procedure. The data preprocessing is particularly important because the result from deep learning can significantly vary depending on it. Even though the data preprocessing procedure can differ according to the aim of research, this study tries to introduce the general way that advanced researchers frequently use for deep learning algorithms. This study is expected to lower the barriers in applying deep learning techniques to linguistic data and make it easier for researchers to conduct deep learning research related to linguistics. © 2020, Korean Society for the Study of English Language and Linguistics. All rights reserved.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Humanities > Division of English Language & Literature > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.