Cited 0 time in
연관성 기반 비유사성을 활용한 범주형 자료 군집분석
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | 이창기 | - |
| dc.contributor.author | 정욱 | - |
| dc.date.accessioned | 2023-04-28T03:41:00Z | - |
| dc.date.available | 2023-04-28T03:41:00Z | - |
| dc.date.issued | 2019-06 | - |
| dc.identifier.issn | 1229-1889 | - |
| dc.identifier.issn | 2287-9005 | - |
| dc.identifier.uri | https://scholarworks.dongguk.edu/handle/sw.dongguk/8005 | - |
| dc.description.abstract | Purpose: The purpose of this study is to suggest a more efficient distance measure taking into account the relationship between categorical variables for categorical data cluster analysis. Methods: In this study, the association-based dissimilarity was employed to calculate the distance between two categorical data observations and the distance obtained from the association-based dissimilarity was applied to the PAM cluster algorithms to verify its effectiveness. The strength of association between two different categorical variables can be calculated using a mixture of dissimilarities between the conditional probability distributions of other categorical variables, given these two categorical values. In particular, this method is suitable for datasets whose categorical variables are highly correlated. Results: The simulation results using several real life data showed that the proposed distance which considered relationships among the categorical variables generally yielded better clustering performance than the Hamming distance. In addition, as the number of correlated variables was increasing, the difference in the performance of the two clustering methods based on different distance measures became statistically more significant. Conclusion: This study revealed that the adoption of the relationship between categorical variables using our proposed method positively affected the results of cluster analysis. | - |
| dc.format.extent | 11 | - |
| dc.language | 한국어 | - |
| dc.language.iso | KOR | - |
| dc.publisher | 한국품질경영학회 | - |
| dc.title | 연관성 기반 비유사성을 활용한 범주형 자료 군집분석 | - |
| dc.title.alternative | Categorical Data Clustering Analysis Using Association-based Dissimilarity | - |
| dc.type | Article | - |
| dc.publisher.location | 대한민국 | - |
| dc.identifier.doi | 10.7469/JKSQM.2019.47.2.271 | - |
| dc.identifier.bibliographicCitation | 품질경영학회지, v.47, no.2, pp 271 - 281 | - |
| dc.citation.title | 품질경영학회지 | - |
| dc.citation.volume | 47 | - |
| dc.citation.number | 2 | - |
| dc.citation.startPage | 271 | - |
| dc.citation.endPage | 281 | - |
| dc.identifier.kciid | ART002473573 | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | kci | - |
| dc.subject.keywordAuthor | Association-based Dissimilarity | - |
| dc.subject.keywordAuthor | Distance Metric | - |
| dc.subject.keywordAuthor | Unsupervised Learning | - |
| dc.subject.keywordAuthor | Categorical Data | - |
| dc.subject.keywordAuthor | Clustering | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114
Copyright(c) 2023 DONGGUK UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
