연관성 기반 비유사성을 활용한 범주형 자료 군집분석Categorical Data Clustering Analysis Using Association-based Dissimilarity
- Other Titles
- Categorical Data Clustering Analysis Using Association-based Dissimilarity
- Authors
- 이창기; 정욱
- Issue Date
- Jun-2019
- Publisher
- 한국품질경영학회
- Keywords
- Association-based Dissimilarity; Distance Metric; Unsupervised Learning; Categorical Data; Clustering
- Citation
- 품질경영학회지, v.47, no.2, pp 271 - 281
- Pages
- 11
- Indexed
- KCI
- Journal Title
- 품질경영학회지
- Volume
- 47
- Number
- 2
- Start Page
- 271
- End Page
- 281
- URI
- https://scholarworks.dongguk.edu/handle/sw.dongguk/8005
- DOI
- 10.7469/JKSQM.2019.47.2.271
- ISSN
- 1229-1889
2287-9005
- Abstract
- Purpose: The purpose of this study is to suggest a more efficient distance measure taking into account the relationship between categorical variables for categorical data cluster analysis.
Methods: In this study, the association-based dissimilarity was employed to calculate the distance between two categorical data observations and the distance obtained from the association-based dissimilarity was applied to the PAM cluster algorithms to verify its effectiveness. The strength of association between two different categorical variables can be calculated using a mixture of dissimilarities between the conditional probability distributions of other categorical variables, given these two categorical values. In particular, this method is suitable for datasets whose categorical variables are highly correlated.
Results: The simulation results using several real life data showed that the proposed distance which considered relationships among the categorical variables generally yielded better clustering performance than the Hamming distance. In addition, as the number of correlated variables was increasing, the difference in the performance of the two clustering methods based on different distance measures became statistically more significant.
Conclusion: This study revealed that the adoption of the relationship between categorical variables using our proposed method positively affected the results of cluster analysis.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - Dongguk Business School > Department of Business Administration > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.