Detailed Information

Cited 25 time in webofscience Cited 33 time in scopus
Metadata Downloads

Adaptive swarm cluster-based dynamic multi-objective synthetic minority oversampling technique algorithm for tackling binary imbalanced datasets in biomedical data classificationopen access

Authors
Li, JinyanFong, SimonSung, YunsickCho, KyungeunWong, RaymondWong, Kelvin K. L.
Issue Date
1-Dec-2016
Publisher
BMC
Keywords
Imbalanced dataset; Swarm optimisation; Under-sampling; SMOTE; Dynamic Multi-objective; Classification; Biomedical data
Citation
BIODATA MINING, v.9, no.1, pp 1 - 15
Pages
15
Indexed
SCIE
SCOPUS
Journal Title
BIODATA MINING
Volume
9
Number
1
Start Page
1
End Page
15
URI
https://scholarworks.dongguk.edu/handle/sw.dongguk/18905
DOI
10.1186/s13040-016-0117-1
ISSN
1756-0381
Abstract
Background: An imbalanced dataset is defined as a training dataset that has imbalanced proportions of data in both interesting and uninteresting classes. Often in biomedical applications, samples from the stimulating class are rare in a population, such as medical anomalies, positive clinical tests, and particular diseases. Although the target samples in the primitive dataset are small in number, the induction of a classification model over such training data leads to poor prediction performance due to insufficient training from the minority class. Results: In this paper, we use a novel class-balancing method named adaptive swarm cluster-based dynamic multi-objective synthetic minority oversampling technique (ASCB_DmSMOTE) to solve this imbalanced dataset problem, which is common in biomedical applications. The proposed method combines undersampling and over-sampling into a swarm optimisation algorithm. It adaptively selects suitable parameters for the rebalancing algorithm to find the best solution. Compared with the other versions of the SMOTE algorithm, significant improvements, which include higher accuracy and credibility, are observed with ASCB_DmSMOTE. Conclusions: Our proposed method tactfully combines two rebalancing techniques together. It reasonably re-allocates the majority class in the details and dynamically optimises the two parameters of SMOTE to synthesise a reasonable scale of minority class for each clustered sub-imbalanced dataset. The proposed methods ultimately overcome other conventional methods and attains higher credibility with even greater accuracy of the classification model.
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Advanced Convergence Engineering > Department of Computer Science and Artificial Intelligence > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Cho, Kyung Eun photo

Cho, Kyung Eun
College of Advanced Convergence Engineering (Department of Computer Science and Artificial Intelligence)
Read more

Altmetrics

Total Views & Downloads

BROWSE