Detailed Information

Cited 6 time in webofscience Cited 9 time in scopus
Metadata Downloads

Improvised methods for tackling big data stream mining challenges: case study of human activity recognition

Authors
Fong, SimonLiu, KexingCho, KyungeunWong, RaymondMohammed, SabahFiaidhi, Jinan
Issue Date
Oct-2016
Publisher
SPRINGER
Keywords
Data stream mining; Big data; Very fast decision tree; Resampling; Sensor data
Citation
JOURNAL OF SUPERCOMPUTING, v.72, no.10, pp 3927 - 3959
Pages
33
Indexed
SCI
SCIE
SCOPUS
Journal Title
JOURNAL OF SUPERCOMPUTING
Volume
72
Number
10
Start Page
3927
End Page
3959
URI
https://scholarworks.dongguk.edu/handle/sw.dongguk/15034
DOI
10.1007/s11227-016-1639-5
ISSN
0920-8542
1573-0484
Abstract
Big data stream is a new hype but a practical computational challenge founded on data streams that are prevalent in applications nowadays. It is quite well known that data streams that are originated and collected from monitoring sensors accumulate continuously to a very huge amount making traditional batch-based model induction algorithms infeasible for real-time data mining or just-in-time data analytics. In this position paper, following a new data stream mining methodology, namely stream-based holistic analytics and reasoning in parallel (SHARP), a list of data analytic challenges as well as improvised methods are looked into. In particular, two types of decision tree algorithms, batch-mode and incremental-mode, are put under test at sensor data that represents a typical big data stream. We investigate whether and to what extent of two improvised methods-outlier removal and balancing imbalanced class distributions-affect the prediction performance in big data stream mining. SHARP is founded on incremental learning which does not require all the training to be loaded into the memory. This important fundamental concept needs to be supported not only by the decision tree algorithms, but by the other improvised methods usually at the preprocessing stage as well. This paper sheds some light into this area which is often overlooked by data analysts when it comes to big data stream mining.
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Advanced Convergence Engineering > Department of Computer Science and Artificial Intelligence > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Cho, Kyung Eun photo

Cho, Kyung Eun
College of Advanced Convergence Engineering (Department of Computer Science and Artificial Intelligence)
Read more

Altmetrics

Total Views & Downloads

BROWSE