Detailed Information

Cited 1 time in webofscience Cited 1 time in scopus
Metadata Downloads

Performance Modeling and Analysis of a Hadoop Cluster for Efficient Big Data Processing

Authors
Lim, JongBeomAhnh, Jong-SukLee, Kang-Woo
Issue Date
Sep-2016
Publisher
AMER SCIENTIFIC PUBLISHERS
Keywords
MapReduce; Hadoop; Performance Model; Big Data
Citation
ADVANCED SCIENCE LETTERS, v.22, no.9, pp 2314 - 2319
Pages
6
Indexed
SCOPUS
Journal Title
ADVANCED SCIENCE LETTERS
Volume
22
Number
9
Start Page
2314
End Page
2319
URI
https://scholarworks.dongguk.edu/handle/sw.dongguk/18025
DOI
10.1166/asl.2016.7813
ISSN
1936-6612
1936-7317
Abstract
Although Apache Hadoop, an open-source implementation of the MapReduce programming model in Java, has become a popular big data framework, it is important to understand the challenges of using Hadoop for varying input data sizes, and how efficient is a Hadoop cluster with configurations. In this regard, there is a need to understand the impact of Hadoop implementation for data-parallel programming model on the performance of big data processing. In this paper, we design a performance model of a Hadoop cluster with consideration of the number of Map and Reduce tasks. Because each Hadoop cluster has its own characteristics and system parameters, it is not enough to use default settings of Hadoop configurations Furthermore, we present performance analysis based on real-world environments using cloud computing. With various performance evaluations, we identified a performance tradeoff between the number of Map and Reduce tasks and processing times of a job. Based on our observations for big data jobs with varying input data sizes, we formulated a performance model for a Hadoop cluster not only in a microscopic view but also in a macroscopic view. Our performance model for Hadoop clusters help estimate the processing rate and the average processing time for given input dataset sizes, and choose suitable configurations, which influence overall Hadoop clusters' performance largely.
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Advanced Convergence Engineering > Department of Computer Science and Artificial Intelligence > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Lee, Kang Woo photo

Lee, Kang Woo
College of Advanced Convergence Engineering (Department of Computer Science and Artificial Intelligence)
Read more

Altmetrics

Total Views & Downloads

BROWSE