Predicting Breast Cancer from Risk Factors Using SVM and Extra-Trees-Based Feature Selection Methodopen access
- Authors
- Alfian, Ganjar; Syafrudin, Muhammad; Fahrurrozi, Imam; Fitriyani, Norma Latif; Atmaji, Fransiskus Tatas Dwi; Widodo, Tri; Bahiyah, Nurul; Benes, Filip; Rhee, Jongtae
- Issue Date
- Sep-2022
- Publisher
- MDPI
- Keywords
- breast cancer; support vector machine; extra-trees; risk factors
- Citation
- Computers, v.11, no.9, pp 1 - 14
- Pages
- 14
- Indexed
- SCOPUS
ESCI
- Journal Title
- Computers
- Volume
- 11
- Number
- 9
- Start Page
- 1
- End Page
- 14
- URI
- https://scholarworks.dongguk.edu/handle/sw.dongguk/2619
- DOI
- 10.3390/computers11090136
- ISSN
- 2073-431X
2073-431X
- Abstract
- Developing a prediction model from risk factors can provide an efficient method to recognize breast cancer. Machine learning (ML) algorithms have been applied to increase the efficiency of diagnosis at the early stage. This paper studies a support vector machine (SVM) combined with an extremely randomized trees classifier (extra-trees) to provide a diagnosis of breast cancer at the early stage based on risk factors. The extra-trees classifier was used to remove irrelevant features, while SVM was utilized to diagnose the breast cancer status. A breast cancer dataset consisting of 116 subjects was utilized by machine learning models to predict breast cancer, while the stratified 10-fold cross-validation was employed for the model evaluation. Our proposed combined SVM and extra-trees model reached the highest accuracy up to 80.23%, which was significantly better than the other ML model. The experimental results demonstrated that by applying extra-trees-based feature selection, the average ML prediction accuracy was improved by up to 7.29% as contrasted to ML without the feature selection method. Our proposed model is expected to increase the efficiency of breast cancer diagnosis based on risk factors. In addition, we presented the proposed prediction model that could be employed for web-based breast cancer prediction. The proposed model is expected to improve diagnostic decision-support systems by predicting breast cancer disease accurately.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Engineering > Department of Industrial and Systems Engineering > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.