Diagnostic Accuracy and Clinical Value of a Domain-specific Multimodal Generative AI Model for Chest Radiograph Report Generation

Hong, Eun Kyoung; Ham, Jiyeon; Roh, Byungseok; Gu, Jawook; Park, Beomhee; Kang, Sunghun; You, Kihyun; Eom, Jihwan; Bae, Byeonguk; Jo, Jae-Bock; Song, Ok Kyu; Bae, Woong; Lee, Ro Woon; Suh, Chong Hyun; Park, Chan Ho; Choi, Seong Jun; Park, Jai Soung; Park, Jae-Hyeong; Jeon, Hyun Jeong; Hong, Jeong-Ho; Cho, Dosang; Choi, Han Seok; Kim, Tae Hee

Detailed Information

Cited 0 time in webofscience

Cited 1 time in scopus

Metadata Downloads

Diagnostic Accuracy and Clinical Value of a Domain-specific Multimodal Generative AI Model for Chest Radiograph Report Generation

Full metadata record

DC Field	Value	Language
dc.contributor.author	Hong, Eun Kyoung	-
dc.contributor.author	Ham, Jiyeon	-
dc.contributor.author	Roh, Byungseok	-
dc.contributor.author	Gu, Jawook	-
dc.contributor.author	Park, Beomhee	-
dc.contributor.author	Kang, Sunghun	-
dc.contributor.author	You, Kihyun	-
dc.contributor.author	Eom, Jihwan	-
dc.contributor.author	Bae, Byeonguk	-
dc.contributor.author	Jo, Jae-Bock	-
dc.contributor.author	Song, Ok Kyu	-
dc.contributor.author	Bae, Woong	-
dc.contributor.author	Lee, Ro Woon	-
dc.contributor.author	Suh, Chong Hyun	-
dc.contributor.author	Park, Chan Ho	-
dc.contributor.author	Choi, Seong Jun	-
dc.contributor.author	Park, Jai Soung	-
dc.contributor.author	Park, Jae-Hyeong	-
dc.contributor.author	Jeon, Hyun Jeong	-
dc.contributor.author	Hong, Jeong-Ho	-
dc.contributor.author	Cho, Dosang	-
dc.contributor.author	Choi, Han Seok	-
dc.contributor.author	Kim, Tae Hee	-
dc.date.accessioned	2025-04-15T01:30:15Z	-
dc.date.available	2025-04-15T01:30:15Z	-
dc.date.issued	2025-03	-
dc.identifier.issn	0033-8419	-
dc.identifier.issn	1527-1315	-
dc.identifier.uri	https://scholarworks.dongguk.edu/handle/sw.dongguk/58229	-
dc.description.abstract	Background Generative artificial intelligence (AI) is anticipated to alter radiology workflows, requiring a clinical value assessment for frequent examinations like chest radiograph interpretation. Purpose To develop and evaluate the diagnostic accuracy and clinical value of a domain-specific multimodal generative AI model for providing preliminary interpretations of chest radiographs. Materials and Methods For training, consecutive radiograph-report pairs from frontal chest radiography were retrospectively collected from 42 hospitals (2005-2023). The trained domain-specific AI model generated radiology reports for the radiographs. The test set included public datasets (PadChest, Open-i, VinDr-CXR, and MIMIC-CXR-JPG) and radiographs excluded from training. The sensitivity and specificity of the model-generated reports for 13 radiographic findings, compared with radiologist annotations (reference standard), were calculated (with 95% CIs). Four radiologists evaluated the subjective quality of the reports in terms of acceptability, agreement score, quality score, and comparative ranking of reports from (a) the domain-specific AI model, (b) radiologists, and (c) a general-purpose large language model (GPT-4Vision). Acceptability was defined as whether the radiologist would endorse the report as their own without changes. Agreement scores from 1 (clinically significant discrepancy) to 5 (complete agreement) were assigned using RADPEER; quality scores were on a 5-point Likert scale from 1 (very poor) to 5 (excellent). Results A total of 8 838 719 radiograph-report pairs (training) and 2145 radiographs (testing) were included (anonymized with respect to sex and gender). Reports generated by the domain-specific AI model demonstrated high sensitivity for detecting two critical radiographic findings: 95.3% (181 of 190) for pneumothorax and 92.6% (138 of 149) for subcutaneous emphysema. Acceptance rate, evaluated by four radiologists, was 70.5% (6047 of 8680), 73.3% (6288 of 8580), and 29.6% (2536 of 8580) for model-generated, radiologist, and GPT-4Vision reports, respectively. Agreement scores were highest for the model-generated reports (median = 4 [IQR, 3-5]) and lowest for GPT-4Vision reports (median = 1 [IQR, 1-3]; P < .001). Quality scores were also highest for the model-generated reports (median = 4 [IQR, 3-5]) and lowest for the GPT-4Vision reports (median = 2 [IQR, 1-3]; P < .001). From the ranking analysis, model-generated reports were most frequently ranked the highest (60.0%; 5146 of 8580), and GPT-4Vision reports were most frequently ranked the lowest (73.6%; 6312 of 8580). Conclusion A domain-specific multimodal generative AI model demonstrated potential for high diagnostic accuracy and clinical value in providing preliminary interpretations of chest radiographs for radiologists. © RSNA, 2025 Supplemental material is available for this article. See also the editorial by Little in this issue.	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	Radiological Society of North America	-
dc.title	Diagnostic Accuracy and Clinical Value of a Domain-specific Multimodal Generative AI Model for Chest Radiograph Report Generation	-
dc.type	Article	-
dc.publisher.location	미국	-
dc.identifier.doi	10.1148/radiol.241476	-
dc.identifier.scopusid	2-s2.0-105002001041	-
dc.identifier.wosid	001464548700007	-
dc.identifier.bibliographicCitation	Radiology, v.314, no.3	-
dc.citation.title	Radiology	-
dc.citation.volume	314	-
dc.citation.number	3	-
dc.type.docType	Article	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Radiology, Nuclear Medicine & Medical Imaging	-
dc.relation.journalWebOfScienceCategory	Radiology, Nuclear Medicine & Medical Imaging	-
dc.subject.keywordAuthor	Pandas Version 2.1.1	-
dc.subject.keywordAuthor	Scipy Version 1.11.3	-
dc.subject.keywordAuthor	Algorithm	-
dc.subject.keywordAuthor	Area Under The Curve	-
dc.subject.keywordAuthor	Article	-
dc.subject.keywordAuthor	Artificial Intelligence	-
dc.subject.keywordAuthor	Atelectasis	-
dc.subject.keywordAuthor	Computer Assisted Tomography	-
dc.subject.keywordAuthor	Controlled Study	-
dc.subject.keywordAuthor	Cross Validation	-
dc.subject.keywordAuthor	Deep Learning	-
dc.subject.keywordAuthor	Diagnostic Accuracy	-
dc.subject.keywordAuthor	Diagnostic Test Accuracy Study	-
dc.subject.keywordAuthor	Follow Up	-
dc.subject.keywordAuthor	Fracture	-
dc.subject.keywordAuthor	Human	-
dc.subject.keywordAuthor	Hyperinflation	-
dc.subject.keywordAuthor	Image Quality	-
dc.subject.keywordAuthor	Lung Edema	-
dc.subject.keywordAuthor	Lung Lesion	-
dc.subject.keywordAuthor	Machine Learning	-
dc.subject.keywordAuthor	Multicenter Study	-
dc.subject.keywordAuthor	Outcome Assessment	-
dc.subject.keywordAuthor	Pleura Effusion	-
dc.subject.keywordAuthor	Pneumothorax	-
dc.subject.keywordAuthor	Prediction	-
dc.subject.keywordAuthor	Predictive Value	-
dc.subject.keywordAuthor	Radiologist	-
dc.subject.keywordAuthor	Receiver Operating Characteristic	-
dc.subject.keywordAuthor	Retrospective Study	-
dc.subject.keywordAuthor	Sensitivity And Specificity	-
dc.subject.keywordAuthor	Subcutaneous Emphysema	-
dc.subject.keywordAuthor	Thorax Radiography	-
dc.subject.keywordAuthor	Training	-
dc.subject.keywordAuthor	Adult	-
dc.subject.keywordAuthor	Aged	-
dc.subject.keywordAuthor	Clinical Trial	-
dc.subject.keywordAuthor	Computer Assisted Diagnosis	-
dc.subject.keywordAuthor	Female	-
dc.subject.keywordAuthor	Male	-
dc.subject.keywordAuthor	Middle Aged	-
dc.subject.keywordAuthor	Procedures	-
dc.subject.keywordAuthor	Reproducibility	-
dc.subject.keywordAuthor	Adult	-
dc.subject.keywordAuthor	Aged	-
dc.subject.keywordAuthor	Artificial Intelligence	-
dc.subject.keywordAuthor	Female	-
dc.subject.keywordAuthor	Humans	-
dc.subject.keywordAuthor	Male	-
dc.subject.keywordAuthor	Middle Aged	-
dc.subject.keywordAuthor	Radiographic Image Interpretation, Computer-assisted	-
dc.subject.keywordAuthor	Radiography, Thoracic	-
dc.subject.keywordAuthor	Reproducibility Of Results	-
dc.subject.keywordAuthor	Retrospective Studies	-
dc.subject.keywordAuthor	Sensitivity And Specificity	-

Files in This Item: There are no files associated with this item.

Appears in Collections: Graduate School > Department of Medicine > 1. Journal Articles

Show simple item record

qrcode

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Altmetrics

Total Views & Downloads

BROWSE