Privacy-Preserving Synthetic Data Generation Method for IoT-Sensor Network IDS Using CTGAN
Citations

WEB OF SCIENCE

11
Citations

SCOPUS

13

초록

The increased usage of IoT networks brings about new privacy risks, especially when intrusion detection systems (IDSs) rely on large datasets for machine learning (ML) tasks and depend on third parties for storing and training the ML-based IDS. This study proposes a privacy-preserving synthetic data generation method using a conditional tabular generative adversarial network (CTGAN) aimed at maintaining the utility of IoT sensor network data for IDS while safeguarding privacy. We integrate differential privacy (DP) with CTGAN by employing controlled noise injection to mitigate privacy risks. The technique involves dynamic distribution adjustment and quantile matching to balance the utility–privacy tradeoff. The results indicate a significant improvement in data utility compared to the standard DP method, achieving a KS test score of 0.80 while minimizing privacy risks such as singling out, linkability, and inference attacks. This approach ensures that synthetic datasets can support intrusion detection without exposing sensitive information. © 2024 by the authors.

키워드

data utilitydeep learningdifferential privacygenerative adversarial networkInternet of thingsintrusion detection systems
제목
Privacy-Preserving Synthetic Data Generation Method for IoT-Sensor Network IDS Using CTGAN
저자
Alabdulwahab, SalehKim, Young-TakSon, Yunsik
DOI
10.3390/s24227389
발행일
2024-11
유형
Article
저널명
Sensors
24
22
페이지
1 ~ 18