Cited 5 time in
Chronic Disease Prediction Model Using Integration of DBSCAN, SMOTE-ENN, and Random Forest
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Norma Latif Fitriyani | - |
| dc.contributor.author | Muhammad Syafrudin | - |
| dc.contributor.author | Ganjar Alfian | - |
| dc.contributor.author | Yang, Chuan-kai | - |
| dc.contributor.author | Rhee, Jongtae | - |
| dc.contributor.author | Siti Maghfirotul Ulyah | - |
| dc.date.accessioned | 2023-04-27T13:41:12Z | - |
| dc.date.available | 2023-04-27T13:41:12Z | - |
| dc.date.issued | 2022 | - |
| dc.identifier.uri | https://scholarworks.dongguk.edu/handle/sw.dongguk/3835 | - |
| dc.description.abstract | Heart disease (HD) is number one chronic disease and becomes a major cause of worldwide disability and death. Aside of HD, type 2 diabetes (T2D) is also as the most deathful diseases that causes serious issues if untreated and undetected. HD and T2D predictions are the most effective measures to control the HD and T2D. Thus, early HD and T2D predictions are important to help individuals in preventing the occurrence of the worst cases. This study proposes a chronic disease prediction model for HD and T2D prediction. The proposed study utilized random forest combined with DBSCAN as outlier detection method and SMOTE-ENN as data balancing method. Two HD datasets (Statlog and Cleveland) and one T2D dataset (NHIS Korea) were used for building the model and comparing the results with other existing machine learning (ML) algorithms, including GNB, LR, MLP, DT, and SVM. To measure the performance of the model, k-fold (10) cross-validation and several performance metrics including accuracy, precision, f-measure, and recall are applied in this study. The results show the model that we proposed outperforms other classification models, as well as previous studies, with accuracy rates 97.63%, 97.69%, and 94.85% for Statlog HD dataset, Cleveland HD dataset and NHIS T2D dataset, respectively. By utilizing the proposed model, it could increase the expectation in preventing the occurrence of the worst case and helping individuals in taking fast and precise actions when status of HD and T2D are detected. © 2022 IEEE. | - |
| dc.format.extent | 6 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | IEEE | - |
| dc.title | Chronic Disease Prediction Model Using Integration of DBSCAN, SMOTE-ENN, and Random Forest | - |
| dc.type | Article | - |
| dc.publisher.location | 미국 | - |
| dc.identifier.doi | 10.1109/ICETSIS55481.2022.9888806 | - |
| dc.identifier.scopusid | 2-s2.0-85140914955 | - |
| dc.identifier.bibliographicCitation | 2022 ASU International Conference in Emerging Technologies for Sustainability and Intelligent Systems (ICETSIS), pp 289 - 294 | - |
| dc.citation.title | 2022 ASU International Conference in Emerging Technologies for Sustainability and Intelligent Systems (ICETSIS) | - |
| dc.citation.startPage | 289 | - |
| dc.citation.endPage | 294 | - |
| dc.type.docType | Conference Paper | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | foreign | - |
| dc.subject.keywordAuthor | heart disease | - |
| dc.subject.keywordAuthor | machine learning | - |
| dc.subject.keywordAuthor | outlier | - |
| dc.subject.keywordAuthor | type 2 diabetes | - |
| dc.subject.keywordAuthor | unbalanced data | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114
Copyright(c) 2023 DONGGUK UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
