Open Access

ARTICLE

Run-Time Dynamic Resource Adjustment for Mitigating Skew in MapReduce

Zhihong Liu1, Shuo Zhang2,*, Yaping Liu2, Xiangke Wang1, Dong Yin1
1 College of Intelligence and Technology, National University of Defense Technology, Changsha, 410073, China
2 Cyberspace Institute of Advanced Technology, Guangzhou University, Guangzhou, 510006, China
* Corresponding Author: Shuo Zhang. Email:
(This article belongs to this Special Issue: Intelligent Models for Security and Resilience in Cyber Physical Systems)

Computer Modeling in Engineering & Sciences 2021, 126(2), 771-790. https://doi.org/10.32604/cmes.2021.013244

Received 30 July 2020; Accepted 15 September 2020; Issue published 21 January 2021

Abstract

MapReduce is a widely used programming model for large-scale data processing. However, it still suffers from the skew problem, which refers to the case in which load is imbalanced among tasks. This problem can cause a small number of tasks to consume much more time than other tasks, thereby prolonging the total job completion time. Existing solutions to this problem commonly predict the loads of tasks and then rebalance the load among them. However, solutions of this kind often incur high performance overhead due to the load prediction and rebalancing. Moreover, existing solutions target the partitioning skew for reduce tasks, but cannot mitigate the computational skew for map tasks. Accordingly, in this paper, we present DynamicAdjust, a run-time dynamic resource adjustment technique for mitigating skew. Rather than rebalancing the load among tasks, DynamicAdjust monitors the run-time execution of tasks and dynamically increases resources for those tasks that require more computation. In so doing, DynamicAdjust can not only eliminate the overhead incurred by load prediction and rebalancing, but also culls both the partitioning skew and the computational skew. Experiments are conducted based on a 21-node real cluster using real-world datasets. The results show that DynamicAdjust can mitigate the negative impact of the skew and shorten the job completion time by up to 40.85%.

Keywords

MapReduce; task scheduling; resource allocation; data skew; big data

Cite This Article

Liu, Z., Zhang, S., Liu, Y., Wang, X., Yin, D. (2021). Run-Time Dynamic Resource Adjustment for Mitigating Skew in MapReduce. CMES-Computer Modeling in Engineering & Sciences, 126(2), 771–790.



This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 1610

    View

  • 1026

    Download

  • 0

    Like

Share Link

WeChat scan