Cited 0 time in
Public values in public R&D through natural language processing
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Jang, Hyejin | - |
| dc.contributor.author | Roh, Taeyeon | - |
| dc.contributor.author | Yoon, Byungun | - |
| dc.date.accessioned | 2025-11-03T06:00:09Z | - |
| dc.date.available | 2025-11-03T06:00:09Z | - |
| dc.date.issued | 2025-10 | - |
| dc.identifier.issn | 2045-2322 | - |
| dc.identifier.issn | 2045-2322 | - |
| dc.identifier.uri | https://scholarworks.dongguk.edu/handle/sw.dongguk/61925 | - |
| dc.description.abstract | Given South Korea’s recent 16.6% reduction in research and development (R&D) budgets for 2023, there is an urgent need for more efficient and strategic R&D policy management. Previous studies evaluating R&D outputs have primarily relied on quantitative metrics or expert opinions, making it challenging to assess qualitative outcomes systematically. However, existing literature lacks data-driven, scalable approaches that go beyond counting outputs to systematically capture and interpret the broader societal and economic impacts of public R&D, particularly in emerging and converging technology fields. This study introduces a novel framework for evaluating the social value of public R&D outputs by focusing on public value (PV) as a critical metric. The approach integrates conventional R&D outputs with external data sources, leveraging advanced natural language processing and deep learning techniques. Specifically, NLP-based document parsing was applied to segment full-text patents into subsections (e.g., Background, Technical Problem, Advantageous Effects), and a large language model (LLM) classifier was used to categorize each segment into predefined public value types. Utilizing a comprehensive dataset of 1642 patents and 422 news columns, the patents were sourced from the National Technology Information Service database of publicly funded AI R&D projects and matched with full-text USPTO records, while news columns were selected from AI-related opinion pieces to capture societal perspectives. Focusing on patents ensures consistency but limits representativeness, classification depends on LLM performance, and PV subjectivity is addressed through our systematic framework. From these sources, the study extracted public value elements across six types: industrial advancements, safe society, sustainable environment, job creation, human health, and convenience of life. These six categories were defined based on national policy documents and technology impact assessments, and operationalized in this study by identifying and classifying text segments from patents and news columns that explicitly reflect each public value type. Our analysis focuses on the top five R&D programs, which were selected based on the highest patent counts among publicly funded AI-related programs, encompassing both educational/workforce development initiatives and key technology development projects. By integrating multiple data sources, including patents and news columns, with advanced NLP and LLM, this study demonstrates a novel approach to R&D policy evaluation that yields richer, evidence-based insights for policymakers and practitioners. The findings offer broader implications for public R&D policy by providing evidence-based insights that inform program prioritization, resource allocation, and the design of impact-oriented evaluation systems. In doing so, the study advances existing practices in R&D policy management by presenting a systematic, data-driven framework for evaluating the societal impacts of public R&D. © 2025 Elsevier B.V., All rights reserved. | - |
| dc.format.extent | 16 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | Nature Portfolio | - |
| dc.title | Public values in public R&D through natural language processing | - |
| dc.type | Article | - |
| dc.publisher.location | 독일 | - |
| dc.identifier.doi | 10.1038/s41598-025-21312-y | - |
| dc.identifier.scopusid | 2-s2.0-105019765570 | - |
| dc.identifier.wosid | 001603727800032 | - |
| dc.identifier.bibliographicCitation | Scientific Reports, v.15, no.1, pp 1 - 16 | - |
| dc.citation.title | Scientific Reports | - |
| dc.citation.volume | 15 | - |
| dc.citation.number | 1 | - |
| dc.citation.startPage | 1 | - |
| dc.citation.endPage | 16 | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | Y | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Science & Technology - Other Topics | - |
| dc.relation.journalWebOfScienceCategory | Multidisciplinary Sciences | - |
| dc.subject.keywordPlus | PERFORMANCE | - |
| dc.subject.keywordPlus | INFORMATION | - |
| dc.subject.keywordPlus | PROGRAMS | - |
| dc.subject.keywordPlus | DESIGN | - |
| dc.subject.keywordAuthor | GPT | - |
| dc.subject.keywordAuthor | Natural language processing | - |
| dc.subject.keywordAuthor | Public R&D | - |
| dc.subject.keywordAuthor | Public value | - |
| dc.subject.keywordAuthor | R&D evaluation | - |
| dc.subject.keywordAuthor | R&D output | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114
Copyright(c) 2023 DONGGUK UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
