Cited 1 time in
Development of an Embedding Framework for Clustering Scientific Papers
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Kim, Songhee | - |
| dc.contributor.author | Lee, Suyeong | - |
| dc.contributor.author | Yoon, Byungun | - |
| dc.date.accessioned | 2023-04-27T14:40:24Z | - |
| dc.date.available | 2023-04-27T14:40:24Z | - |
| dc.date.issued | 2022 | - |
| dc.identifier.issn | 2169-3536 | - |
| dc.identifier.issn | 2169-3536 | - |
| dc.identifier.uri | https://scholarworks.dongguk.edu/handle/sw.dongguk/3912 | - |
| dc.description.abstract | In this era, research and development are becoming a continuous and accelerating process because technology changes rapidly with a short lifecycle. As a result, various methodologies are being developed to monitor these rapidly changing research trends; In particular, clustering method-related studies in science and technology documents are being developed with a variety of approaches. However, previous studies on document clustering methods focus on a specific field or language but do not take into consideration certain important pieces of information in science and technology documents. Therefore, this study proposes an embedding methodology that uses important content from scientific and technical documents. We took into consideration the importance of information containing core structures in science and technology documents and proposed a clustering methodology that analyzes structured and unstructured data, such as textual information, author information, and citation information. The proposed method combines both textual and structural data from the paper, using a method that focuses on screening important information by sections in science and technology documents. Then, Girvan-Newman clustering and Louvain clustering models are applied to generate embedding vectors and show evaluation results through the clustering indices. As a practical example, we applied the proposed methodology using paper data from the field of hydrogen cell vehicles. The results of this study will be effective in identifying gaps in technology for new technological development, identifying technology trends, and presenting directional information for future technology development. | - |
| dc.format.extent | 14 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | IEEE | - |
| dc.title | Development of an Embedding Framework for Clustering Scientific Papers | - |
| dc.type | Article | - |
| dc.publisher.location | 미국 | - |
| dc.identifier.doi | 10.1109/ACCESS.2022.3160826 | - |
| dc.identifier.scopusid | 2-s2.0-85127060606 | - |
| dc.identifier.wosid | 000776245200001 | - |
| dc.identifier.bibliographicCitation | IEEE Access, v.10, pp 32608 - 32621 | - |
| dc.citation.title | IEEE Access | - |
| dc.citation.volume | 10 | - |
| dc.citation.startPage | 32608 | - |
| dc.citation.endPage | 32621 | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | Y | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalResearchArea | Engineering | - |
| dc.relation.journalResearchArea | Telecommunications | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Information Systems | - |
| dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
| dc.relation.journalWebOfScienceCategory | Telecommunications | - |
| dc.subject.keywordPlus | EXTRACTION | - |
| dc.subject.keywordAuthor | Data mining | - |
| dc.subject.keywordAuthor | Patents | - |
| dc.subject.keywordAuthor | Market research | - |
| dc.subject.keywordAuthor | Clustering methods | - |
| dc.subject.keywordAuthor | Codes | - |
| dc.subject.keywordAuthor | Research and development | - |
| dc.subject.keywordAuthor | Metadata | - |
| dc.subject.keywordAuthor | Clustering method | - |
| dc.subject.keywordAuthor | data mining | - |
| dc.subject.keywordAuthor | text mining | - |
| dc.subject.keywordAuthor | text analysis | - |
| dc.subject.keywordAuthor | scientific publishing | - |
| dc.subject.keywordAuthor | fuel cells | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114
Copyright(c) 2023 DONGGUK UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
