Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

연관성 기반 비유사성을 활용한 범주형 자료 군집분석Categorical Data Clustering Analysis Using Association-based Dissimilarity

Other Titles
Categorical Data Clustering Analysis Using Association-based Dissimilarity
Authors
이창기정욱
Issue Date
Jun-2019
Publisher
한국품질경영학회
Keywords
Association-based Dissimilarity; Distance Metric; Unsupervised Learning; Categorical Data; Clustering
Citation
품질경영학회지, v.47, no.2, pp 271 - 281
Pages
11
Indexed
KCI
Journal Title
품질경영학회지
Volume
47
Number
2
Start Page
271
End Page
281
URI
https://scholarworks.dongguk.edu/handle/sw.dongguk/8005
DOI
10.7469/JKSQM.2019.47.2.271
ISSN
1229-1889
2287-9005
Abstract
Purpose: The purpose of this study is to suggest a more efficient distance measure taking into account the relationship between categorical variables for categorical data cluster analysis. Methods: In this study, the association-based dissimilarity was employed to calculate the distance between two categorical data observations and the distance obtained from the association-based dissimilarity was applied to the PAM cluster algorithms to verify its effectiveness. The strength of association between two different categorical variables can be calculated using a mixture of dissimilarities between the conditional probability distributions of other categorical variables, given these two categorical values. In particular, this method is suitable for datasets whose categorical variables are highly correlated. Results: The simulation results using several real life data showed that the proposed distance which considered relationships among the categorical variables generally yielded better clustering performance than the Hamming distance. In addition, as the number of correlated variables was increasing, the difference in the performance of the two clustering methods based on different distance measures became statistically more significant. Conclusion: This study revealed that the adoption of the relationship between categorical variables using our proposed method positively affected the results of cluster analysis.
Files in This Item
There are no files associated with this item.
Appears in
Collections
Dongguk Business School > Department of Business Administration > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Jung, Uk photo

Jung, Uk
Dongguk Business School (Department of Business Administration)
Read more

Altmetrics

Total Views & Downloads

BROWSE