Cited 1 time in
Diagnostic Accuracy and Clinical Value of a Domain-specific Multimodal Generative AI Model for Chest Radiograph Report Generation
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Hong, Eun Kyoung | - |
| dc.contributor.author | Ham, Jiyeon | - |
| dc.contributor.author | Roh, Byungseok | - |
| dc.contributor.author | Gu, Jawook | - |
| dc.contributor.author | Park, Beomhee | - |
| dc.contributor.author | Kang, Sunghun | - |
| dc.contributor.author | You, Kihyun | - |
| dc.contributor.author | Eom, Jihwan | - |
| dc.contributor.author | Bae, Byeonguk | - |
| dc.contributor.author | Jo, Jae-Bock | - |
| dc.contributor.author | Song, Ok Kyu | - |
| dc.contributor.author | Bae, Woong | - |
| dc.contributor.author | Lee, Ro Woon | - |
| dc.contributor.author | Suh, Chong Hyun | - |
| dc.contributor.author | Park, Chan Ho | - |
| dc.contributor.author | Choi, Seong Jun | - |
| dc.contributor.author | Park, Jai Soung | - |
| dc.contributor.author | Park, Jae-Hyeong | - |
| dc.contributor.author | Jeon, Hyun Jeong | - |
| dc.contributor.author | Hong, Jeong-Ho | - |
| dc.contributor.author | Cho, Dosang | - |
| dc.contributor.author | Choi, Han Seok | - |
| dc.contributor.author | Kim, Tae Hee | - |
| dc.date.accessioned | 2025-04-15T01:30:15Z | - |
| dc.date.available | 2025-04-15T01:30:15Z | - |
| dc.date.issued | 2025-03 | - |
| dc.identifier.issn | 0033-8419 | - |
| dc.identifier.issn | 1527-1315 | - |
| dc.identifier.uri | https://scholarworks.dongguk.edu/handle/sw.dongguk/58229 | - |
| dc.description.abstract | Background Generative artificial intelligence (AI) is anticipated to alter radiology workflows, requiring a clinical value assessment for frequent examinations like chest radiograph interpretation. Purpose To develop and evaluate the diagnostic accuracy and clinical value of a domain-specific multimodal generative AI model for providing preliminary interpretations of chest radiographs. Materials and Methods For training, consecutive radiograph-report pairs from frontal chest radiography were retrospectively collected from 42 hospitals (2005-2023). The trained domain-specific AI model generated radiology reports for the radiographs. The test set included public datasets (PadChest, Open-i, VinDr-CXR, and MIMIC-CXR-JPG) and radiographs excluded from training. The sensitivity and specificity of the model-generated reports for 13 radiographic findings, compared with radiologist annotations (reference standard), were calculated (with 95% CIs). Four radiologists evaluated the subjective quality of the reports in terms of acceptability, agreement score, quality score, and comparative ranking of reports from (a) the domain-specific AI model, (b) radiologists, and (c) a general-purpose large language model (GPT-4Vision). Acceptability was defined as whether the radiologist would endorse the report as their own without changes. Agreement scores from 1 (clinically significant discrepancy) to 5 (complete agreement) were assigned using RADPEER; quality scores were on a 5-point Likert scale from 1 (very poor) to 5 (excellent). Results A total of 8 838 719 radiograph-report pairs (training) and 2145 radiographs (testing) were included (anonymized with respect to sex and gender). Reports generated by the domain-specific AI model demonstrated high sensitivity for detecting two critical radiographic findings: 95.3% (181 of 190) for pneumothorax and 92.6% (138 of 149) for subcutaneous emphysema. Acceptance rate, evaluated by four radiologists, was 70.5% (6047 of 8680), 73.3% (6288 of 8580), and 29.6% (2536 of 8580) for model-generated, radiologist, and GPT-4Vision reports, respectively. Agreement scores were highest for the model-generated reports (median = 4 [IQR, 3-5]) and lowest for GPT-4Vision reports (median = 1 [IQR, 1-3]; P < .001). Quality scores were also highest for the model-generated reports (median = 4 [IQR, 3-5]) and lowest for the GPT-4Vision reports (median = 2 [IQR, 1-3]; P < .001). From the ranking analysis, model-generated reports were most frequently ranked the highest (60.0%; 5146 of 8580), and GPT-4Vision reports were most frequently ranked the lowest (73.6%; 6312 of 8580). Conclusion A domain-specific multimodal generative AI model demonstrated potential for high diagnostic accuracy and clinical value in providing preliminary interpretations of chest radiographs for radiologists. © RSNA, 2025 Supplemental material is available for this article. See also the editorial by Little in this issue. | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | Radiological Society of North America | - |
| dc.title | Diagnostic Accuracy and Clinical Value of a Domain-specific Multimodal Generative AI Model for Chest Radiograph Report Generation | - |
| dc.type | Article | - |
| dc.publisher.location | 미국 | - |
| dc.identifier.doi | 10.1148/radiol.241476 | - |
| dc.identifier.scopusid | 2-s2.0-105002001041 | - |
| dc.identifier.wosid | 001464548700007 | - |
| dc.identifier.bibliographicCitation | Radiology, v.314, no.3 | - |
| dc.citation.title | Radiology | - |
| dc.citation.volume | 314 | - |
| dc.citation.number | 3 | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Radiology, Nuclear Medicine & Medical Imaging | - |
| dc.relation.journalWebOfScienceCategory | Radiology, Nuclear Medicine & Medical Imaging | - |
| dc.subject.keywordAuthor | Pandas Version 2.1.1 | - |
| dc.subject.keywordAuthor | Scipy Version 1.11.3 | - |
| dc.subject.keywordAuthor | Algorithm | - |
| dc.subject.keywordAuthor | Area Under The Curve | - |
| dc.subject.keywordAuthor | Article | - |
| dc.subject.keywordAuthor | Artificial Intelligence | - |
| dc.subject.keywordAuthor | Atelectasis | - |
| dc.subject.keywordAuthor | Computer Assisted Tomography | - |
| dc.subject.keywordAuthor | Controlled Study | - |
| dc.subject.keywordAuthor | Cross Validation | - |
| dc.subject.keywordAuthor | Deep Learning | - |
| dc.subject.keywordAuthor | Diagnostic Accuracy | - |
| dc.subject.keywordAuthor | Diagnostic Test Accuracy Study | - |
| dc.subject.keywordAuthor | Follow Up | - |
| dc.subject.keywordAuthor | Fracture | - |
| dc.subject.keywordAuthor | Human | - |
| dc.subject.keywordAuthor | Hyperinflation | - |
| dc.subject.keywordAuthor | Image Quality | - |
| dc.subject.keywordAuthor | Lung Edema | - |
| dc.subject.keywordAuthor | Lung Lesion | - |
| dc.subject.keywordAuthor | Machine Learning | - |
| dc.subject.keywordAuthor | Multicenter Study | - |
| dc.subject.keywordAuthor | Outcome Assessment | - |
| dc.subject.keywordAuthor | Pleura Effusion | - |
| dc.subject.keywordAuthor | Pneumothorax | - |
| dc.subject.keywordAuthor | Prediction | - |
| dc.subject.keywordAuthor | Predictive Value | - |
| dc.subject.keywordAuthor | Radiologist | - |
| dc.subject.keywordAuthor | Receiver Operating Characteristic | - |
| dc.subject.keywordAuthor | Retrospective Study | - |
| dc.subject.keywordAuthor | Sensitivity And Specificity | - |
| dc.subject.keywordAuthor | Subcutaneous Emphysema | - |
| dc.subject.keywordAuthor | Thorax Radiography | - |
| dc.subject.keywordAuthor | Training | - |
| dc.subject.keywordAuthor | Adult | - |
| dc.subject.keywordAuthor | Aged | - |
| dc.subject.keywordAuthor | Clinical Trial | - |
| dc.subject.keywordAuthor | Computer Assisted Diagnosis | - |
| dc.subject.keywordAuthor | Female | - |
| dc.subject.keywordAuthor | Male | - |
| dc.subject.keywordAuthor | Middle Aged | - |
| dc.subject.keywordAuthor | Procedures | - |
| dc.subject.keywordAuthor | Reproducibility | - |
| dc.subject.keywordAuthor | Adult | - |
| dc.subject.keywordAuthor | Aged | - |
| dc.subject.keywordAuthor | Artificial Intelligence | - |
| dc.subject.keywordAuthor | Female | - |
| dc.subject.keywordAuthor | Humans | - |
| dc.subject.keywordAuthor | Male | - |
| dc.subject.keywordAuthor | Middle Aged | - |
| dc.subject.keywordAuthor | Radiographic Image Interpretation, Computer-assisted | - |
| dc.subject.keywordAuthor | Radiography, Thoracic | - |
| dc.subject.keywordAuthor | Reproducibility Of Results | - |
| dc.subject.keywordAuthor | Retrospective Studies | - |
| dc.subject.keywordAuthor | Sensitivity And Specificity | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114
Copyright(c) 2023 DONGGUK UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
