A maximum mean discrepancy-based autoencoder approach for dimension reduction with binary responses

Moon, Youngho; Lee, Yung-Seop

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

A maximum mean discrepancy-based autoencoder approach for dimension reduction with binary responses

Full metadata record

DC Field	Value	Language
dc.contributor.author	Moon, Youngho	-
dc.contributor.author	Lee, Yung-Seop	-
dc.date.accessioned	2025-08-05T02:30:17Z	-
dc.date.available	2025-08-05T02:30:17Z	-
dc.date.issued	2025-12	-
dc.identifier.issn	1226-3192	-
dc.identifier.issn	2005-2863	-
dc.identifier.uri	https://scholarworks.dongguk.edu/handle/sw.dongguk/58857	-
dc.description.abstract	The advent of the big data era has led to a significant increase in the utilization of high-dimensional data. The challenges arising from increased data dimension are often referred to as the "curse of dimensionality." Consequently, various dimension reduction (DR) techniques are being actively researched to address these challenges. In particular, applying DR techniques to high-dimensional datasets-such as those encountered in digital imaging, natural language processing, and genomics, often comprising hundreds of or thousands of variable-has gained considerable attention as a means to overcome the curse of dimensionality. This study proposes a novel DR technique utilizing a supervised autoencoder model for binary classification scenarios where the response variable is binary. The proposed method maps high-dimensional data into a lower-dimensional latent space learned by the autoencoder. Subsequently, it employs the Maximum Mean Discrepancy (MMD) loss function to enhance the linear separability between distinct classes within this latent representation. During the autoencoder's training, the MMD loss encourages samples from the same class to group closely together while simultaneously maximizing the distance between samples from different classes. Recognizing the critical importance of classification performance (e.g., distinguishing between defective and non-defective items) following dimension reduction in high-dimensional data with binary response variables, we conducted a comparative evaluation using seven distinct high-dimensional datasets. Experimental results demonstrate that the proposed model achieves superior performance compared to other established DR techniques, in terms of classification accuracy and F1-score.	-
dc.format.extent	27	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	한국통계학회	-
dc.title	A maximum mean discrepancy-based autoencoder approach for dimension reduction with binary responses	-
dc.type	Article	-
dc.publisher.location	대한민국	-
dc.identifier.doi	10.1007/s42952-025-00338-y	-
dc.identifier.scopusid	2-s2.0-105012184202	-
dc.identifier.wosid	001537773900001	-
dc.identifier.bibliographicCitation	Journal of the Korean Statistical Society, v.54, no.4, pp 1269 - 1295	-
dc.citation.title	Journal of the Korean Statistical Society	-
dc.citation.volume	54	-
dc.citation.number	4	-
dc.citation.startPage	1269	-
dc.citation.endPage	1295	-
dc.type.docType	Article	-
dc.identifier.kciid	ART003291905	-
dc.description.isOpenAccess	Y	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.description.journalRegisteredClass	kci	-
dc.relation.journalResearchArea	Mathematics	-
dc.relation.journalWebOfScienceCategory	Statistics & Probability	-
dc.subject.keywordAuthor	High dimension	-
dc.subject.keywordAuthor	Dimension reduction	-
dc.subject.keywordAuthor	Autoencoder	-
dc.subject.keywordAuthor	Maximum mean discrepancy loss function	-

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Natural Science > Department of Statistics > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Lee, Yung Seop photo

Lee, Yung Seop: College of Natural Science (Department of Statistics)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE