A Study on Detection of Malicious Behavior Based on Host Process Data Using Machine Learning

Han, Ryeobin; Kim, Kookjin; Choi, Byunghun; Jeong, Youngsik

Detailed Information

Cited 7 time in webofscience

Cited 11 time in scopus

Metadata Downloads

A Study on Detection of Malicious Behavior Based on Host Process Data Using Machine Learningopen access

Authors: Han, Ryeobin; Kim, Kookjin; Choi, Byunghun; Jeong, Youngsik

Issue Date: Apr-2023

Publisher: MDPI

Keywords: behavior detection; anomaly detection; cyber security; machine learning

Citation: Applied Sciences, v.13, no.7, pp 1 - 17

Pages: 17

Indexed: SCIE
SCOPUS

Journal Title: Applied Sciences

Volume: 13

Number: 7

Start Page: 1

End Page: 17

URI: https://scholarworks.dongguk.edu/handle/sw.dongguk/18686

DOI: 10.3390/app13074097

ISSN: 2076-3417
2076-3417

Abstract: With the rapid increase in the number of cyber-attacks, detecting and preventing malicious behavior has become more important than ever before. In this study, we propose a method for detecting and classifying malicious behavior in host process data using machine learning algorithms. One of the challenges in this study is dealing with high-dimensional and imbalanced data. To address this, we first preprocessed the data using Principal Component Analysis (PCA) and Uniform Manifold Approximation and Projection (UMAP) to reduce the dimensions of the data and visualize the distribution. We then used the Adaptive Synthetic (ADASYN) and Synthetic Minority Over-sampling Technique (SMOTE) to handle the imbalanced data. We trained and evaluated the performance of the models using various machine learning algorithms, such as K-Nearest Neighbor, Naive Bayes, Random Forest, Autoencoder, and Memory-Augmented Deep Autoencoder (MemAE). Our results show that the preprocessed datasets using both ADASYN and SMOTE significantly improved the performance of all models, achieving higher precision, recall, and F1-Score values. Notably, the best performance was obtained when using the preprocessed dataset (SMOTE) with the MemAE model, yielding an F1-Score of 1.00. The evaluation was also conducted by measuring the Area Under the Receiver Operating Characteristic Curve (AUROC), which showed that all models performed well with an AUROC of over 90%. Our proposed method provides a promising approach for detecting and classifying malicious behavior in host process data using machine learning algorithms, which can be used in various fields such as anomaly detection and medical diagnosis.

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Advanced Convergence Engineering > Department of Computer Science and Artificial Intelligence > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Jeong, Young Sik photo

Jeong, Young Sik: College of Advanced Convergence Engineering (Department of Computer Science and Artificial Intelligence)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE