Cited 2 time in
Context-Based Geodesic Dissimilarity Measure for Clustering Categorical Data
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Lee, Changki | - |
| dc.contributor.author | Jung, Uk | - |
| dc.date.accessioned | 2023-04-27T16:40:23Z | - |
| dc.date.available | 2023-04-27T16:40:23Z | - |
| dc.date.issued | 2021-09 | - |
| dc.identifier.issn | 2076-3417 | - |
| dc.identifier.issn | 2076-3417 | - |
| dc.identifier.uri | https://scholarworks.dongguk.edu/handle/sw.dongguk/4526 | - |
| dc.description.abstract | Measuring the dissimilarity between two observations is the basis of many data mining and machine learning algorithms, and its effectiveness has a significant impact on learning outcomes. The dissimilarity or distance computation has been a manageable problem for continuous data because many numerical operations can be successfully applied. However, unlike continuous data, defining a dissimilarity between pairs of observations with categorical variables is not straightforward. This study proposes a new method to measure the dissimilarity between two categorical observations, called a context-based geodesic dissimilarity measure, for the categorical data clustering problem. The proposed method considers the relationships between categorical variables and discovers the implicit topological structures in categorical data. In other words, it can effectively reflect the nonlinear patterns of arbitrarily shaped categorical data clusters. Our experimental results confirm that the proposed measure that considers both nonlinear data patterns and relationships among the categorical variables yields better clustering performance than other distance measures. | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | MDPI | - |
| dc.title | Context-Based Geodesic Dissimilarity Measure for Clustering Categorical Data | - |
| dc.type | Article | - |
| dc.publisher.location | 스위스 | - |
| dc.identifier.doi | 10.3390/app11188416 | - |
| dc.identifier.scopusid | 2-s2.0-85114662298 | - |
| dc.identifier.wosid | 000700242300001 | - |
| dc.identifier.bibliographicCitation | APPLIED SCIENCES-BASEL, v.11, no.18 | - |
| dc.citation.title | APPLIED SCIENCES-BASEL | - |
| dc.citation.volume | 11 | - |
| dc.citation.number | 18 | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | Y | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Chemistry | - |
| dc.relation.journalResearchArea | Engineering | - |
| dc.relation.journalResearchArea | Materials Science | - |
| dc.relation.journalResearchArea | Physics | - |
| dc.relation.journalWebOfScienceCategory | Chemistry, Multidisciplinary | - |
| dc.relation.journalWebOfScienceCategory | Engineering, Multidisciplinary | - |
| dc.relation.journalWebOfScienceCategory | Materials Science, Multidisciplinary | - |
| dc.relation.journalWebOfScienceCategory | Physics, Applied | - |
| dc.subject.keywordPlus | DIVERGENCE | - |
| dc.subject.keywordPlus | ALGORITHM | - |
| dc.subject.keywordAuthor | geodesic distance | - |
| dc.subject.keywordAuthor | categorical data | - |
| dc.subject.keywordAuthor | mutual k-nearest neighbor graph | - |
| dc.subject.keywordAuthor | association-based dissimilarity | - |
| dc.subject.keywordAuthor | Gower distance | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114
Copyright(c) 2023 DONGGUK UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
