JOURNAL ARTICLE

Fault-tolerant Cloud Workflow Scheduling with Uncertain Task Execution Time

Abstract

Resource performance fluctuation and resource failure become two main factors that affect task execution in cloud systems, especially for deadline-constrained workflow instances with precedence relationships among tasks. Lots of cloud workflow scheduling algorithms have been designed for fault tolerance or performance fluctuation separately while less work consider these two issues simultaneously. In this paper, an algorithm named FCWSU is proposed to fault-tolerant scheduling workflows with uncertain task execution time caused by resource performance fluctuation in clouds. A novel workflow scheduling architecture is designed in FCWSU to mitigate the delay propagation caused by either performance fluctuation or failure of VMs. A PB (Primary-Backup) model based scheduling algorithm is proposed for cloud resource failure tolerance. Experiment results show that FCWSU can provide better scheduling strategy for deadline-constrained workflows than corresponding competitors.

Keywords:
Computer science Workflow Distributed computing Cloud computing Scheduling (production processes) Backup Fault tolerance Dynamic priority scheduling Real-time computing Schedule Database Operating system Engineering

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
16
Refs
0.23
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Cloud Computing and Resource Management
Physical Sciences →  Computer Science →  Information Systems
Distributed and Parallel Computing Systems
Physical Sciences →  Computer Science →  Computer Networks and Communications
Scientific Computing and Data Management
Social Sciences →  Decision Sciences →  Information Systems and Management
© 2026 ScienceGate Book Chapters — All rights reserved.