JOURNAL ARTICLE

Transformation-Based Streaming Workflow Allocation on Geo-Distributed Datacenters for Streaming Big Data Processing

Wuhui ChenIncheon PaikPatrick C. K. Hung

Year: 2016 Journal:   IEEE Transactions on Services Computing Vol: 12 (4)Pages: 654-668   Publisher: Institute of Electrical and Electronics Engineers

Abstract

The cost-minimization problem for streaming workflow (SW) has already become increasingly important and even critical in stream big data processing, particularly for geographically distributed datacenters, because of its huge demand on computing and communicating resources. Existing virtual machine (VM) allocation algorithms in cloud computing have been widely applied to batch-processing models; however, none of them can be successfully applied to SW because: 1) they failed to adapt the continuous execution characteristic of SW; and 2) most of them are all based on the assumption that the price of traffic and VMs among datacenters are uniform. In this paper, we propose a transformation-based SW allocation algorithm with the goal of cost-minimization for stream big data processing in geographically distributed datacenters, considering the characteristics of SW and price heterogeneity among geographically distributed datacenters. We first propose a cost-aware workflow transformation framework based on eight well-designed and verified transformation rules for cost reduction to adapt the continuous execution characteristic of SW. We then formulate the joint VM-traffic optimization problem and show that it is NP-hard. To produce the optimal solution in polynomial time, we then transform the SW allocation problem into the minimum-cost maximum-flow problem, considering both traffic and VMs price heterogeneity. Finally, our experimental results validate the high cost efficiency of our approach with lower computing and communicating costs by optimizing the workflow specification and joint VM-traffic cost optimization.

Keywords:
Computer science Workflow Cloud computing Distributed computing Stream processing Server Minification Big data Optimization problem Computer network Algorithm Database Data mining Operating system

Metrics

18
Cited By
4.65
FWCI (Field Weighted Citation Impact)
43
Refs
0.95
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Cloud Computing and Resource Management
Physical Sciences →  Computer Science →  Information Systems
Data Stream Mining Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
IoT and Edge/Fog Computing
Physical Sciences →  Computer Science →  Computer Networks and Communications

Related Documents

JOURNAL ARTICLE

Cost-Aware Streaming Workflow Allocation on Geo-Distributed Data Centers

Wuhui ChenIncheon PaikZhenni Li

Journal:   IEEE Transactions on Computers Year: 2016 Pages: 1-1
DISSERTATION

Streaming workflow transformation

Tjalling van der Wal

University:   Australian Dental Journal Year: 2011 Vol: 12 (2)Pages: 152-9
JOURNAL ARTICLE

Location based Continuous Query Processing over Geo-streaming Data

K. V. Metre

Journal:   Türk bilgisayar ve matematik eğitimi dergisi Year: 2021 Vol: 12 (1S)Pages: 106-114
JOURNAL ARTICLE

Distributed Data Center Bandwidth Allocation for Cloud-Based Streaming

Fanxin KongXingjian LuXue Liu

Journal:   IEEE Transactions on Sustainable Computing Year: 2017 Vol: 4 (2)Pages: 263-276
© 2026 ScienceGate Book Chapters — All rights reserved.