Cited 33 time in
TechWord: Development of a technology lexical database for structuring textual technology information based on natural language processing
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Jang, Hyejin | - |
| dc.contributor.author | Jeong, Yujin | - |
| dc.contributor.author | Yoon, Byungun | - |
| dc.date.accessioned | 2024-08-08T07:30:46Z | - |
| dc.date.available | 2024-08-08T07:30:46Z | - |
| dc.date.issued | 2021-02 | - |
| dc.identifier.issn | 0957-4174 | - |
| dc.identifier.issn | 1873-6793 | - |
| dc.identifier.uri | https://scholarworks.dongguk.edu/handle/sw.dongguk/19530 | - |
| dc.description.abstract | The role of text mining based on technological documents such as patents is important in the research field of technology intelligence for technology R&D planning. In addition, WordNet, an English-based lexical database, is widely used for pre-processing text data such as word lemmatization and synonym search. However, technological vocabulary information is complex and specific, and WordNet's ability to analyze technological information is limited in its reflecting technological features. Thus, to improve the text mining performance of technological information, this study proposes a methodology for designing a TechWord-based lexical database that is based on the lexical characteristics of technological words that are differentiated from general words. To do this, we define TechWord, a technology lexical information, and construct a TechSynset, a synonym set between TechWords. First, through dependency parsing between words, TechWord, a unit word that describes a technology, is structured and identifies nouns and verbs. The importance of connectivity is investigated by a network centrality index analysis based on the dependency relations of words. Subsequently, to search for synonyms suitable for the target technology domain, a TechSynset is constructed through synset information, with an additional analysis that calculates cosine similarity based on a word embedding vector. Applying the proposed methodology to the actual technology-related information analysis, we collect patent data on the technological fields of the automotive field, and present the results of the TechWord and TechSynset. This study improves technological information-based text mining by structuring the word-to-word link information in technological documents based on an automated process. | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | PERGAMON-ELSEVIER SCIENCE LTD | - |
| dc.title | TechWord: Development of a technology lexical database for structuring textual technology information based on natural language processing | - |
| dc.type | Article | - |
| dc.publisher.location | 영국 | - |
| dc.identifier.doi | 10.1016/j.eswa.2020.114042 | - |
| dc.identifier.scopusid | 2-s2.0-85091777411 | - |
| dc.identifier.wosid | 000599704400007 | - |
| dc.identifier.bibliographicCitation | EXPERT SYSTEMS WITH APPLICATIONS, v.164 | - |
| dc.citation.title | EXPERT SYSTEMS WITH APPLICATIONS | - |
| dc.citation.volume | 164 | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalResearchArea | Engineering | - |
| dc.relation.journalResearchArea | Operations Research & Management Science | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Artificial Intelligence | - |
| dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
| dc.relation.journalWebOfScienceCategory | Operations Research & Management Science | - |
| dc.subject.keywordPlus | EMERGING TECHNOLOGIES | - |
| dc.subject.keywordPlus | INTELLIGENCE SYSTEM | - |
| dc.subject.keywordPlus | PATENT ANALYSIS | - |
| dc.subject.keywordPlus | WORDNET | - |
| dc.subject.keywordPlus | TRENDS | - |
| dc.subject.keywordPlus | AREAS | - |
| dc.subject.keywordAuthor | Patent mining | - |
| dc.subject.keywordAuthor | Natural language processing | - |
| dc.subject.keywordAuthor | Text mining | - |
| dc.subject.keywordAuthor | Lexical analysis | - |
| dc.subject.keywordAuthor | WordNet | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114
Copyright(c) 2023 DONGGUK UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
