Cited 47 time in
A Two-Stage Big Data Analytics Framework with Real World Applications Using Spark Machine Learning and Long Short-Term Memory Network
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Khan, Muhammad Ashfaq | - |
| dc.contributor.author | Karim, Md Rezaul | - |
| dc.contributor.author | Kim, Yangwoo | - |
| dc.date.accessioned | 2024-08-08T03:30:51Z | - |
| dc.date.available | 2024-08-08T03:30:51Z | - |
| dc.date.issued | 2018-10 | - |
| dc.identifier.issn | 2073-8994 | - |
| dc.identifier.issn | 2073-8994 | - |
| dc.identifier.uri | https://scholarworks.dongguk.edu/handle/sw.dongguk/17000 | - |
| dc.description.abstract | Every day we experience unprecedented data growth from numerous sources, which contribute to big data in terms of volume, velocity, and variability. These datasets again impose great challenges to analytics framework and computational resources, making the overall analysis difficult for extracting meaningful information in a timely manner. Thus, to harness these kinds of challenges, developing an efficient big data analytics framework is an important research topic. Consequently, to address these challenges by exploiting non-linear relationships from very large and high-dimensional datasets, machine learning (ML) and deep learning (DL) algorithms are being used in analytics frameworks. Apache Spark has been in use as the fastest big data processing arsenal, which helps to solve iterative ML tasks, using distributed ML library called Spark MLlib. Considering real-world research problems, DL architectures such as Long Short-Term Memory (LSTM) is an effective approach to overcoming practical issues such as reduced accuracy, long-term sequence dependency, and vanishing and exploding gradient in conventional deep architectures. In this paper, we propose an efficient analytics framework, which is technically a progressive machine learning technique merged with Spark-based linear models, Multilayer Perceptron (MLP) and LSTM, using a two-stage cascade structure in order to enhance the predictive accuracy. Our proposed architecture enables us to organize big data analytics in a scalable and efficient way. To show the effectiveness of our framework, we applied the cascading structure to two different real-life datasets to solve a multiclass and a binary classification problem, respectively. Experimental results show that our analytical framework outperforms state-of-the-art approaches with a high-level of classification accuracy. | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | MDPI | - |
| dc.title | A Two-Stage Big Data Analytics Framework with Real World Applications Using Spark Machine Learning and Long Short-Term Memory Network | - |
| dc.type | Article | - |
| dc.publisher.location | 스위스 | - |
| dc.identifier.doi | 10.3390/sym10100485 | - |
| dc.identifier.scopusid | 2-s2.0-85055729049 | - |
| dc.identifier.wosid | 000448561000065 | - |
| dc.identifier.bibliographicCitation | SYMMETRY-BASEL, v.10, no.10 | - |
| dc.citation.title | SYMMETRY-BASEL | - |
| dc.citation.volume | 10 | - |
| dc.citation.number | 10 | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | Y | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Science & Technology - Other Topics | - |
| dc.relation.journalWebOfScienceCategory | Multidisciplinary Sciences | - |
| dc.subject.keywordPlus | APACHE SPARK | - |
| dc.subject.keywordAuthor | big data | - |
| dc.subject.keywordAuthor | big data analytics | - |
| dc.subject.keywordAuthor | machine learning | - |
| dc.subject.keywordAuthor | deep learning | - |
| dc.subject.keywordAuthor | spark MLlib | - |
| dc.subject.keywordAuthor | multilayer perceptron | - |
| dc.subject.keywordAuthor | long short-term memory | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114
Copyright(c) 2023 DONGGUK UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
