비선형 패턴을 지닌 범주형 자료의 군집분석을 위한 그래프 기반 거리 측도Graph-Based Distance Measure for Clustering Categorical Data with Nonlinear Patterns
- Other Titles
- Graph-Based Distance Measure for Clustering Categorical Data with Nonlinear Patterns
- Authors
- 이창기; 정욱
- Issue Date
- Jun-2019
- Publisher
- 한국신뢰성학회
- Keywords
- Geodesic Distance; Graph-Based Distance; Categorical Data; Gower Distance
- Citation
- 신뢰성 응용연구, v.19, no.2, pp 141 - 148
- Pages
- 8
- Indexed
- KCI
- Journal Title
- 신뢰성 응용연구
- Volume
- 19
- Number
- 2
- Start Page
- 141
- End Page
- 148
- URI
- https://scholarworks.dongguk.edu/handle/sw.dongguk/8012
- DOI
- 10.33162/JAR.2019.06.19.2.141
- ISSN
- 1738-9895
2733-8320
- Abstract
- Purpose: The purpose of this study is to suggest a more efficient distance measure taking into account the patterns of data for clustering categorical data Methods: The proposed categorical geodesic distance is calculated with three main steps: (1) The first step measures the Gower distance between two observations composed of categorical variables. (2) The second step is to represent the data as a mutual k-nearest neighbor graph. (3) The final step calculates the distance between two observations with the shortest path in the graph. The distance obtained from (3) is utilized for clustering categorical data. In particular, our proposed method is suitable for data with nonlinear patterns.
Results: Our experimental results using several real-life datasets reveal that the categorical data also has implicit topological structures and confirm that the distance considering implicit data patterns generally yields better clustering performance than existing Gower distance measure.
Conclusion: This study revealed that the adoption of the data patterns using our proposed distance measure positively affected the results of cluster analysis.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - Dongguk Business School > Department of Business Administration > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.