ADSTREAM: Anomaly Detection in Large-Scale Data Streams Using Local Outlier Factor Based on Micro-Cluster
- Authors
- Seo, Sanghyun; Park, Seongchul; Hwang, Injea; Kim, Juntae
- Issue Date
- Oct-2017
- Publisher
- AMER SCIENTIFIC PUBLISHERS
- Keywords
- Anomaly Detection; Large-Scale Data Stream; Micro Cluster; Local Outlier Factor
- Citation
- ADVANCED SCIENCE LETTERS, v.23, no.10, pp 10204 - 10209
- Pages
- 6
- Indexed
- SCOPUS
- Journal Title
- ADVANCED SCIENCE LETTERS
- Volume
- 23
- Number
- 10
- Start Page
- 10204
- End Page
- 10209
- URI
- https://scholarworks.dongguk.edu/handle/sw.dongguk/14797
- DOI
- 10.1166/asl.2017.10419
- ISSN
- 1936-6612
1936-7317
- Abstract
- Micro-cluster based clustering methods perform efficient clustering for the large-scale data stream by using two components which are online phase and offline phase. An online component creates micro-clusters for input data stream and an offline component performs final clustering based on a formed micro-cluster from online component. However, since these methods are passive for anomaly detection, there are disadvantages in that outliers are not specified. Most existing methodologies first cluster all data and then set the data not clustered as outliers. Although the typical micro-cluster based data stream clustering methods are excellent in clustering quality, these methodologies are not suitable for anomaly detection which should clarify what data is outliers. In this paper, we propose ADSTREAM using a Local Outlier Factor for center of micro-clusters in the offline component for detecting and specifying outliers. In the experiment, we visualize the anomaly detection results of ADSTREAM and perform micro-cluster based anomaly detections on the large-scale streams of the KDDCUP1999 dataset and show that the performance of anomaly detection performed by ADSTREAM is improved dramatically compared to the existing micro-cluster based clustering methods. As a result, ADSTREAM is able to efficiently perform anomaly detection while preserving the advantages of existing data stream clustering algorithms for real-time large-scale streams.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Advanced Convergence Engineering > Department of Computer Science and Artificial Intelligence > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.