Cited 0 time in
Corporate Bankruptcy Prediction Model using Explainable AI-based Feature Selection
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | 문건두 | - |
| dc.contributor.author | 김경재 | - |
| dc.date.accessioned | 2024-08-08T08:01:29Z | - |
| dc.date.available | 2024-08-08T08:01:29Z | - |
| dc.date.issued | 2023-06 | - |
| dc.identifier.issn | 2288-4866 | - |
| dc.identifier.issn | 2288-4882 | - |
| dc.identifier.uri | https://scholarworks.dongguk.edu/handle/sw.dongguk/20164 | - |
| dc.description.abstract | A corporate insolvency prediction model serves as a vital tool for objectively monitoring the financial condition of companies. It enables timely warnings, facilitates responsive actions, and supports the formulation of effective management strategies to mitigate bankruptcy risks and enhance performance. Investors and financial institutions utilize default prediction models to minimize financial losses. As the interest in utilizing artificial intelligence (AI) technology for corporate insolvency prediction grows, extensive research has been conducted in this domain. However, there is an increasing demand for explainable AI models in corporate insolvency prediction, emphasizing interpretability and reliability. The SHAP (SHapley Additive exPlanations) technique has gained significant popularity and has demonstrated strong performance in various applications. Nonetheless, it has limitations such as computational cost, processing time, and scalability concerns based on the number of variables. This study introduces a novel approach to variable selection that reduces the number of variables by averaging SHAP values from bootstrapped data subsets instead of using the entire dataset. This technique aims to improve computational efficiency while maintaining excellent predictive performance. To obtain classification results, we aim to train random forest, XGBoost, and C5.0 models using carefully selected variables with high interpretability. The classification accuracy of the ensemble model, generated through soft voting as the goal of high-performance model design, is compared with the individual models. The study leverages data from 1,698 Korean light industrial companies and employs bootstrapping to create distinct data groups. Logistic Regression is employed to calculate SHAP values for each data group, and their averages are computed to derive the final SHAP values. The proposed model enhances interpretability and aims to achieve superior predictive performance. | - |
| dc.format.extent | 25 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | 한국지능정보시스템학회 | - |
| dc.title | Corporate Bankruptcy Prediction Model using Explainable AI-based Feature Selection | - |
| dc.title.alternative | Corporate Bankruptcy Prediction Model using Explainable AI-based Feature Selection | - |
| dc.type | Article | - |
| dc.publisher.location | 대한민국 | - |
| dc.identifier.doi | 10.13088/jiis.2023.29.2.241 | - |
| dc.identifier.bibliographicCitation | 지능정보연구, v.29, no.2, pp 241 - 265 | - |
| dc.citation.title | 지능정보연구 | - |
| dc.citation.volume | 29 | - |
| dc.citation.number | 2 | - |
| dc.citation.startPage | 241 | - |
| dc.citation.endPage | 265 | - |
| dc.identifier.kciid | ART002973026 | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | kci | - |
| dc.subject.keywordAuthor | Explainable AI | - |
| dc.subject.keywordAuthor | Corporate Bankruptcy Prediction | - |
| dc.subject.keywordAuthor | SHAP | - |
| dc.subject.keywordAuthor | Random Forest | - |
| dc.subject.keywordAuthor | XGBoost | - |
| dc.subject.keywordAuthor | 설명가능 인공지능 | - |
| dc.subject.keywordAuthor | 기업부실예측 | - |
| dc.subject.keywordAuthor | SHAP | - |
| dc.subject.keywordAuthor | Random Forest | - |
| dc.subject.keywordAuthor | XGBoost | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114
Copyright(c) 2023 DONGGUK UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
