JOURNAL ARTICLE

On Datacenter-Network-Aware Load Balancing in MapReduce

Abstract

MapReduce has emerged as a powerful tool for distributed and scalable processing of voluminous data. For skewed data input, load balancing is necessary among the MapReduce worker nodes to minimize the overall finishing time, which however can incur massive data movement in a data center network. In this paper, we for the first time examine this problem of data center-network-aware load balancing in the shuffle sub phase in MapReduce. Different from earlier studies that generally assume the network inside a data center has negligible delay and infinite capacity, we consider the traffic and bottlenecks in real data center networks by introducing the constraints on available network bandwidth, and demonstrate that the corresponding problem can be decomposed into two sub problems for network flow and load balancing, respectively. We show effective solutions to both of them, which together yield a complete solution towards near optimal data center-network-aware load balancing. A much simpler yet performance-wise comparable greedy algorithm is also developed for fast implementation in practice. The effectiveness of our solution has been demonstrated on synthetic and real public datasets.

Keywords:
Computer science Load balancing (electrical power) Data center Scalability Distributed computing Bandwidth (computing) Load management Greedy algorithm Flow network Computer network Algorithm Mathematical optimization Database

Metrics

3
Cited By
0.79
FWCI (Field Weighted Citation Impact)
23
Refs
0.80
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Cloud Computing and Resource Management
Physical Sciences →  Computer Science →  Information Systems
Software-Defined Networks and 5G
Physical Sciences →  Computer Science →  Computer Networks and Communications
Interconnection Networks and Systems
Physical Sciences →  Computer Science →  Computer Networks and Communications
© 2026 ScienceGate Book Chapters — All rights reserved.