Detailed Information

Cited 0 time in webofscience Cited 1 time in scopus
Metadata Downloads

Diagnostic Accuracy and Clinical Value of a Domain-specific Multimodal Generative AI Model for Chest Radiograph Report Generation

Full metadata record
DC Field Value Language
dc.contributor.authorHong, Eun Kyoung-
dc.contributor.authorHam, Jiyeon-
dc.contributor.authorRoh, Byungseok-
dc.contributor.authorGu, Jawook-
dc.contributor.authorPark, Beomhee-
dc.contributor.authorKang, Sunghun-
dc.contributor.authorYou, Kihyun-
dc.contributor.authorEom, Jihwan-
dc.contributor.authorBae, Byeonguk-
dc.contributor.authorJo, Jae-Bock-
dc.contributor.authorSong, Ok Kyu-
dc.contributor.authorBae, Woong-
dc.contributor.authorLee, Ro Woon-
dc.contributor.authorSuh, Chong Hyun-
dc.contributor.authorPark, Chan Ho-
dc.contributor.authorChoi, Seong Jun-
dc.contributor.authorPark, Jai Soung-
dc.contributor.authorPark, Jae-Hyeong-
dc.contributor.authorJeon, Hyun Jeong-
dc.contributor.authorHong, Jeong-Ho-
dc.contributor.authorCho, Dosang-
dc.contributor.authorChoi, Han Seok-
dc.contributor.authorKim, Tae Hee-
dc.date.accessioned2025-04-15T01:30:15Z-
dc.date.available2025-04-15T01:30:15Z-
dc.date.issued2025-03-
dc.identifier.issn0033-8419-
dc.identifier.issn1527-1315-
dc.identifier.urihttps://scholarworks.dongguk.edu/handle/sw.dongguk/58229-
dc.description.abstractBackground Generative artificial intelligence (AI) is anticipated to alter radiology workflows, requiring a clinical value assessment for frequent examinations like chest radiograph interpretation. Purpose To develop and evaluate the diagnostic accuracy and clinical value of a domain-specific multimodal generative AI model for providing preliminary interpretations of chest radiographs. Materials and Methods For training, consecutive radiograph-report pairs from frontal chest radiography were retrospectively collected from 42 hospitals (2005-2023). The trained domain-specific AI model generated radiology reports for the radiographs. The test set included public datasets (PadChest, Open-i, VinDr-CXR, and MIMIC-CXR-JPG) and radiographs excluded from training. The sensitivity and specificity of the model-generated reports for 13 radiographic findings, compared with radiologist annotations (reference standard), were calculated (with 95% CIs). Four radiologists evaluated the subjective quality of the reports in terms of acceptability, agreement score, quality score, and comparative ranking of reports from (a) the domain-specific AI model, (b) radiologists, and (c) a general-purpose large language model (GPT-4Vision). Acceptability was defined as whether the radiologist would endorse the report as their own without changes. Agreement scores from 1 (clinically significant discrepancy) to 5 (complete agreement) were assigned using RADPEER; quality scores were on a 5-point Likert scale from 1 (very poor) to 5 (excellent). Results A total of 8 838 719 radiograph-report pairs (training) and 2145 radiographs (testing) were included (anonymized with respect to sex and gender). Reports generated by the domain-specific AI model demonstrated high sensitivity for detecting two critical radiographic findings: 95.3% (181 of 190) for pneumothorax and 92.6% (138 of 149) for subcutaneous emphysema. Acceptance rate, evaluated by four radiologists, was 70.5% (6047 of 8680), 73.3% (6288 of 8580), and 29.6% (2536 of 8580) for model-generated, radiologist, and GPT-4Vision reports, respectively. Agreement scores were highest for the model-generated reports (median = 4 [IQR, 3-5]) and lowest for GPT-4Vision reports (median = 1 [IQR, 1-3]; P < .001). Quality scores were also highest for the model-generated reports (median = 4 [IQR, 3-5]) and lowest for the GPT-4Vision reports (median = 2 [IQR, 1-3]; P < .001). From the ranking analysis, model-generated reports were most frequently ranked the highest (60.0%; 5146 of 8580), and GPT-4Vision reports were most frequently ranked the lowest (73.6%; 6312 of 8580). Conclusion A domain-specific multimodal generative AI model demonstrated potential for high diagnostic accuracy and clinical value in providing preliminary interpretations of chest radiographs for radiologists. © RSNA, 2025 Supplemental material is available for this article. See also the editorial by Little in this issue.-
dc.language영어-
dc.language.isoENG-
dc.publisherRadiological Society of North America-
dc.titleDiagnostic Accuracy and Clinical Value of a Domain-specific Multimodal Generative AI Model for Chest Radiograph Report Generation-
dc.typeArticle-
dc.publisher.location미국-
dc.identifier.doi10.1148/radiol.241476-
dc.identifier.scopusid2-s2.0-105002001041-
dc.identifier.wosid001464548700007-
dc.identifier.bibliographicCitationRadiology, v.314, no.3-
dc.citation.titleRadiology-
dc.citation.volume314-
dc.citation.number3-
dc.type.docTypeArticle-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaRadiology, Nuclear Medicine & Medical Imaging-
dc.relation.journalWebOfScienceCategoryRadiology, Nuclear Medicine & Medical Imaging-
dc.subject.keywordAuthorPandas Version 2.1.1-
dc.subject.keywordAuthorScipy Version 1.11.3-
dc.subject.keywordAuthorAlgorithm-
dc.subject.keywordAuthorArea Under The Curve-
dc.subject.keywordAuthorArticle-
dc.subject.keywordAuthorArtificial Intelligence-
dc.subject.keywordAuthorAtelectasis-
dc.subject.keywordAuthorComputer Assisted Tomography-
dc.subject.keywordAuthorControlled Study-
dc.subject.keywordAuthorCross Validation-
dc.subject.keywordAuthorDeep Learning-
dc.subject.keywordAuthorDiagnostic Accuracy-
dc.subject.keywordAuthorDiagnostic Test Accuracy Study-
dc.subject.keywordAuthorFollow Up-
dc.subject.keywordAuthorFracture-
dc.subject.keywordAuthorHuman-
dc.subject.keywordAuthorHyperinflation-
dc.subject.keywordAuthorImage Quality-
dc.subject.keywordAuthorLung Edema-
dc.subject.keywordAuthorLung Lesion-
dc.subject.keywordAuthorMachine Learning-
dc.subject.keywordAuthorMulticenter Study-
dc.subject.keywordAuthorOutcome Assessment-
dc.subject.keywordAuthorPleura Effusion-
dc.subject.keywordAuthorPneumothorax-
dc.subject.keywordAuthorPrediction-
dc.subject.keywordAuthorPredictive Value-
dc.subject.keywordAuthorRadiologist-
dc.subject.keywordAuthorReceiver Operating Characteristic-
dc.subject.keywordAuthorRetrospective Study-
dc.subject.keywordAuthorSensitivity And Specificity-
dc.subject.keywordAuthorSubcutaneous Emphysema-
dc.subject.keywordAuthorThorax Radiography-
dc.subject.keywordAuthorTraining-
dc.subject.keywordAuthorAdult-
dc.subject.keywordAuthorAged-
dc.subject.keywordAuthorClinical Trial-
dc.subject.keywordAuthorComputer Assisted Diagnosis-
dc.subject.keywordAuthorFemale-
dc.subject.keywordAuthorMale-
dc.subject.keywordAuthorMiddle Aged-
dc.subject.keywordAuthorProcedures-
dc.subject.keywordAuthorReproducibility-
dc.subject.keywordAuthorAdult-
dc.subject.keywordAuthorAged-
dc.subject.keywordAuthorArtificial Intelligence-
dc.subject.keywordAuthorFemale-
dc.subject.keywordAuthorHumans-
dc.subject.keywordAuthorMale-
dc.subject.keywordAuthorMiddle Aged-
dc.subject.keywordAuthorRadiographic Image Interpretation, Computer-assisted-
dc.subject.keywordAuthorRadiography, Thoracic-
dc.subject.keywordAuthorReproducibility Of Results-
dc.subject.keywordAuthorRetrospective Studies-
dc.subject.keywordAuthorSensitivity And Specificity-
Files in This Item
There are no files associated with this item.
Appears in
Collections
Graduate School > Department of Medicine > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetrics

Total Views & Downloads

BROWSE