JOURNAL ARTICLE

Incendio: Priority-Based Scheduling for Alleviating Cold Start in Serverless Computing

X.T. CaiQianlong SangChuang HuYili GongKun SuoXiaobo ZhouDazhao Cheng

Year: 2024 Journal:   IEEE Transactions on Computers Vol: 73 (7)Pages: 1780-1794   Publisher: Institute of Electrical and Electronics Engineers

Abstract

In serverless computing, cold start results in long response latency. Existing approaches strive to alleviate the issue by reducing the number of cold starts. However, our measurement based on real-world production traces shows that the minimum number of cold starts does not equate to the minimum response latency, and solely focusing on optimizing the number of cold starts will lead to sub-optimal performance. The root cause is that functions have different priorities in terms of latency benefits by transferring a cold start to a warm start. In this paper, we propose Incendio , a serverless computing framework exploiting priority-based scheduling to minimize the overall response latency from the perspective of cloud providers. We reveal the priority of a function is correlated to multiple factors and design a priority model based on Spearman's rank correlation coefficient. We integrate a hybrid Prophet-LightGBM prediction model to dynamically manage runtime pools, which enables the system to prewarm containers in advance and terminate containers at the appropriate time. Furthermore, to satisfy the low-cost and high-accuracy requirements in serverless computing, we propose a Clustered Reinforcement Learning-based function scheduling strategy. The evaluations show that Incendio speeds up the native system by 1.4×, and achieves 23% and 14.8% latency reductions compared to two state-of-the-art approaches.

Keywords:
Computer science Latency (audio) Scheduling (production processes) Response time Cloud computing Distributed computing Reinforcement learning Real-time computing Operating system Artificial intelligence Mathematical optimization Mathematics Telecommunications

Metrics

10
Cited By
15.28
FWCI (Field Weighted Citation Impact)
36
Refs
0.98
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Cloud Computing and Resource Management
Physical Sciences →  Computer Science →  Information Systems
IoT and Edge/Fog Computing
Physical Sciences →  Computer Science →  Computer Networks and Communications
Caching and Content Delivery
Physical Sciences →  Computer Science →  Computer Networks and Communications
© 2026 ScienceGate Book Chapters — All rights reserved.