JOURNAL ARTICLE

Efficient Data and Task Co-Scheduling for Scientific Workflow in Geo-Distributed Datacenters

Abstract

Scientific workflow usually needs to be performed in multiple collaborative datacenters for the requirement of accessing community-wide resources. However, the movements of initial input data and intermediate data across geo-distributed datacenters would hinder efficient execution of large-scale dataintensive scientific workflows. In this paper, a novel scheduling approach based on graph partition is proposed for the execution of data-intensive scientific workflow in geo-distributed datacenters, aiming at the optimization of the overall data transfer cost. Simulations show that our algorithm significantly reduces the overall geo-distributed data transfer and demonstrate its effectiveness.

Keywords:
Computer science Workflow Distributed computing Scheduling (production processes) Partition (number theory) Workflow technology Workflow management system Distributed database Workflow engine Distributed Computing Environment Database

Metrics

7
Cited By
4.46
FWCI (Field Weighted Citation Impact)
17
Refs
0.94
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Scientific Computing and Data Management
Social Sciences →  Decision Sciences →  Information Systems and Management
Distributed and Parallel Computing Systems
Physical Sciences →  Computer Science →  Computer Networks and Communications
Cloud Computing and Resource Management
Physical Sciences →  Computer Science →  Information Systems
© 2026 ScienceGate Book Chapters — All rights reserved.