JOURNAL ARTICLE

Heterogeneous Environment Aware Streaming Graph Partitioning

Ning XuBin CuiLei ChenZi HuangYingxia Shao

Year: 2014 Journal:   IEEE Transactions on Knowledge and Data Engineering Vol: 27 (6)Pages: 1560-1572   Publisher: IEEE Computer Society

Abstract

With the increasing availability of graph data and widely adopted cloud computing paradigm, graph partitioning has become an efficient pre-processing technique to balance the computing workload and cope with the large scale of input data. Since the cost of partitioning the entire graph is strictly prohibitive, there are some recent tentative works towards streaming graph partitioning which run faster, are easily parallelized, and can be incrementally updated. Most of the existing works on streaming partitioning assume that worker nodes within a cluster are homogeneous in nature. Unfortunately, this assumption does not always hold. Experiments show that these homogeneous algorithms suffer a significant performance degradation when running at heterogeneous environment. In this paper, we propose a novel adaptive streaming graph partitioning approach to cope with heterogeneous environment. We first formally model the heterogeneous computing environment with the consideration of the unbalance of computing ability (e.g., the CPU frequency) and communication ability (e.g., the network bandwidth) for each node. Based on this model, we propose a new graph partitioning objective function that aims to minimize the total execution time of the graph-processing job. We then explore some simple yet effective streaming algorithms for this objective function that can achieve balanced and efficient partitioning result. Extensive experiments are conducted on a moderate sized computing cluster with real-world web and social network graphs. The results demonstrate that the proposed approach achieves significant improvement compared with the state-of-the-art solutions.

Keywords:
Computer science Graph partition Cloud computing Distributed computing Graph Workload Homogeneous Parallel computing Theoretical computer science Operating system

Metrics

44
Cited By
3.86
FWCI (Field Weighted Citation Impact)
35
Refs
0.95
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Graph Theory and Algorithms
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Interconnection Networks and Systems
Physical Sciences →  Computer Science →  Computer Networks and Communications
Parallel Computing and Optimization Techniques
Physical Sciences →  Computer Science →  Hardware and Architecture

Related Documents

JOURNAL ARTICLE

Streaming graph partitioning

Zainab AbbasVasiliki KalavriParis CarboneVladimir Vlassov

Journal:   Proceedings of the VLDB Endowment Year: 2018 Vol: 11 (11)Pages: 1590-1603
JOURNAL ARTICLE

Buffered Streaming Graph Partitioning

Marcelo Fonseca FarajChristian Schulz

Journal:   ACM Journal of Experimental Algorithmics Year: 2022 Vol: 27 Pages: 1-26
© 2026 ScienceGate Book Chapters — All rights reserved.