Cited 220 time in
Data-Driven Cervical Cancer Prediction Model with Outlier Detection and Over-Sampling Methods
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Ijaz, Muhammad Fazal | - |
| dc.contributor.author | Attique, Muhammad | - |
| dc.contributor.author | Son, Youngdoo | - |
| dc.date.accessioned | 2023-04-27T23:40:41Z | - |
| dc.date.available | 2023-04-27T23:40:41Z | - |
| dc.date.issued | 2020-05 | - |
| dc.identifier.issn | 1424-8220 | - |
| dc.identifier.issn | 1424-3210 | - |
| dc.identifier.uri | https://scholarworks.dongguk.edu/handle/sw.dongguk/6671 | - |
| dc.description.abstract | Globally, cervical cancer remains as the foremost prevailing cancer in females. Hence, it is necessary to distinguish the importance of risk factors of cervical cancer to classify potential patients. The present work proposes a cervical cancer prediction model (CCPM) that offers early prediction of cervical cancer using risk factors as inputs. The CCPM first removes outliers by using outlier detection methods such as density-based spatial clustering of applications with noise (DBSCAN) and isolation forest (iForest) and by increasing the number of cases in the dataset in a balanced way, for example, through synthetic minority over-sampling technique (SMOTE) and SMOTE with Tomek link (SMOTETomek). Finally, it employs random forest (RF) as a classifier. Thus, CCPM lies on four scenarios: (1) DBSCAN + SMOTETomek + RF, (2) DBSCAN + SMOTE+ RF, (3) iForest + SMOTETomek + RF, and (4) iForest + SMOTE + RF. A dataset of 858 potential patients was used to validate the performance of the proposed method. We found that combinations of iForest with SMOTE and iForest with SMOTETomek provided better performances than those of DBSCAN with SMOTE and DBSCAN with SMOTETomek. We also observed that RF performed the best among several popular machine learning classifiers. Furthermore, the proposed CCPM showed better accuracy than previously proposed methods for forecasting cervical cancer. In addition, a mobile application that can collect cervical cancer risk factors data and provides results from CCPM is developed for instant and proper action at the initial stage of cervical cancer. | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | MDPI | - |
| dc.title | Data-Driven Cervical Cancer Prediction Model with Outlier Detection and Over-Sampling Methods | - |
| dc.type | Article | - |
| dc.publisher.location | 스위스 | - |
| dc.identifier.doi | 10.3390/s20102809 | - |
| dc.identifier.scopusid | 2-s2.0-85084964723 | - |
| dc.identifier.wosid | 000539323700063 | - |
| dc.identifier.bibliographicCitation | SENSORS, v.20, no.10 | - |
| dc.citation.title | SENSORS | - |
| dc.citation.volume | 20 | - |
| dc.citation.number | 10 | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | Y | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Chemistry | - |
| dc.relation.journalResearchArea | Engineering | - |
| dc.relation.journalResearchArea | Instruments & Instrumentation | - |
| dc.relation.journalWebOfScienceCategory | Chemistry, Analytical | - |
| dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
| dc.relation.journalWebOfScienceCategory | Instruments & Instrumentation | - |
| dc.subject.keywordPlus | DIFFUSION-WEIGHTED MRI | - |
| dc.subject.keywordPlus | FEATURE-SELECTION | - |
| dc.subject.keywordPlus | PARKINSONS-DISEASE | - |
| dc.subject.keywordPlus | DIAGNOSIS | - |
| dc.subject.keywordPlus | RISK | - |
| dc.subject.keywordPlus | CLASSIFICATION | - |
| dc.subject.keywordPlus | NETWORKS | - |
| dc.subject.keywordPlus | CYTOLOGY | - |
| dc.subject.keywordPlus | SMOTE | - |
| dc.subject.keywordAuthor | cancer | - |
| dc.subject.keywordAuthor | artificial intelligence | - |
| dc.subject.keywordAuthor | digital health | - |
| dc.subject.keywordAuthor | machine learning | - |
| dc.subject.keywordAuthor | medical information systems | - |
| dc.subject.keywordAuthor | cervical cancer | - |
| dc.subject.keywordAuthor | imbalanced data analysis | - |
| dc.subject.keywordAuthor | outlier detection | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114
Copyright(c) 2023 DONGGUK UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
