Development of an Embedding Framework for Clustering Scientific Papersopen access
- Authors
- Kim, Songhee; Lee, Suyeong; Yoon, Byungun
- Issue Date
- 2022
- Publisher
- IEEE
- Keywords
- Data mining; Patents; Market research; Clustering methods; Codes; Research and development; Metadata; Clustering method; data mining; text mining; text analysis; scientific publishing; fuel cells
- Citation
- IEEE Access, v.10, pp 32608 - 32621
- Pages
- 14
- Indexed
- SCIE
SCOPUS
- Journal Title
- IEEE Access
- Volume
- 10
- Start Page
- 32608
- End Page
- 32621
- URI
- https://scholarworks.dongguk.edu/handle/sw.dongguk/3912
- DOI
- 10.1109/ACCESS.2022.3160826
- ISSN
- 2169-3536
2169-3536
- Abstract
- In this era, research and development are becoming a continuous and accelerating process because technology changes rapidly with a short lifecycle. As a result, various methodologies are being developed to monitor these rapidly changing research trends; In particular, clustering method-related studies in science and technology documents are being developed with a variety of approaches. However, previous studies on document clustering methods focus on a specific field or language but do not take into consideration certain important pieces of information in science and technology documents. Therefore, this study proposes an embedding methodology that uses important content from scientific and technical documents. We took into consideration the importance of information containing core structures in science and technology documents and proposed a clustering methodology that analyzes structured and unstructured data, such as textual information, author information, and citation information. The proposed method combines both textual and structural data from the paper, using a method that focuses on screening important information by sections in science and technology documents. Then, Girvan-Newman clustering and Louvain clustering models are applied to generate embedding vectors and show evaluation results through the clustering indices. As a practical example, we applied the proposed methodology using paper data from the field of hydrogen cell vehicles. The results of this study will be effective in identifying gaps in technology for new technological development, identifying technology trends, and presenting directional information for future technology development.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Engineering > Department of Industrial and Systems Engineering > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.