Cited 0 time in
AutoSCAN: automatic detection of DBSCAN parameters and efficient clustering of data in overlapping density regions
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Bushra, Adil Abdu | - |
| dc.contributor.author | Kim, Dongyeon | - |
| dc.contributor.author | Kan, Yejin | - |
| dc.contributor.author | Yi, Gangman | - |
| dc.date.accessioned | 2024-08-08T11:02:02Z | - |
| dc.date.available | 2024-08-08T11:02:02Z | - |
| dc.date.issued | 2024-03 | - |
| dc.identifier.issn | 2376-5992 | - |
| dc.identifier.issn | 2376-5992 | - |
| dc.identifier.uri | https://scholarworks.dongguk.edu/handle/sw.dongguk/21656 | - |
| dc.description.abstract | The density-based clustering method is considered a robust approach in unsupervised clustering technique due to its ability to identify outliers, form clusters of irregular shapes and automatically determine the number of clusters. These unique properties helped its pioneering algorithm, the Density-based Spatial Clustering on Applications with Noise (DBSCAN), become applicable in datasets where various number of clusters of different shapes and sizes could be detected without much interference from the user. However, the original algorithm exhibits limitations, especially towards its sensitivity on its user input parameters minPts and 8. Additionally, the algorithm assigned inconsistent cluster labels to data objects found in overlapping density regions of separate clusters, hence lowering its accuracy. To alleviate these specific problems and increase the clustering accuracy, we propose two methods that use the statistical data from a given dataset's k-nearest neighbor density distribution in order to determine the optimal 8 values. Our approach removes the burden on the users, and automatically detects the clusters of a given dataset. Furthermore, a method to identify the accurate border objects of separate clusters is proposed and implemented to solve the unpredictability of the original algorithm. Finally, in our experiments, we show that our efficient re-implementation of the original algorithm to automatically cluster datasets and improve the clustering quality of adjoining cluster members provides increase in clustering accuracy and faster running times when compared to earlier approaches. | - |
| dc.format.extent | 31 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | PeerJ Inc. | - |
| dc.title | AutoSCAN: automatic detection of DBSCAN parameters and efficient clustering of data in overlapping density regions | - |
| dc.type | Article | - |
| dc.publisher.location | 영국 | - |
| dc.identifier.doi | 10.7717/peerj-cs.1921 | - |
| dc.identifier.scopusid | 2-s2.0-85190402462 | - |
| dc.identifier.wosid | 001186533700002 | - |
| dc.identifier.bibliographicCitation | PeerJ Computer Science, v.10, pp 1 - 31 | - |
| dc.citation.title | PeerJ Computer Science | - |
| dc.citation.volume | 10 | - |
| dc.citation.startPage | 1 | - |
| dc.citation.endPage | 31 | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | Y | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Artificial Intelligence | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Information Systems | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Theory & Methods | - |
| dc.subject.keywordAuthor | DBSCAN | - |
| dc.subject.keywordAuthor | Density-based clustering | - |
| dc.subject.keywordAuthor | Unsupervised clustering | - |
| dc.subject.keywordAuthor | K-nearest neighbors | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114
Copyright(c) 2023 DONGGUK UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
