JOURNAL ARTICLE

Characterizing Co-Located Workloads in Alibaba Cloud Datacenters

Congfeng JiangYitao QiuWeisong ShiZhefeng GeJiwei WangShenglei ChenChristophe CérinZujie RenGuoyao XuJiangbin Lin

Year: 2020 Journal:   IEEE Transactions on Cloud Computing Vol: 10 (4)Pages: 2381-2397   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Workload characteristics are vital for both data center operation and job scheduling in co-located data centers, where online services and batch jobs are deployed on the same production cluster. In this article, a comprehensive analysis is conducted on Alibaba's cluster-trace-v2018 of a production cluster of 4034 machines. The findings and insights are the following: (1) The workload on the production cluster poses a daily cyclical fluctuation, in terms of CPU and disk I/O utilization, and the memory system has become the performance bottleneck of a co-located cluster. (2) Batch jobs including their tasks and derived instances can be approximated as Zipf distribution. However, for all batch jobs with directed acyclic graph dependency, they suffer from co-location with online services since the online services are highly prioritized. (3) The resource usages of containers have similar cyclical fluctuation consistent with the whole cluster, while their memory usages remain approximately constant. (4) The number of batch jobs co-located with online services is dependent on the mispredictions per kilo instructions of online services. In order to guarantee the QoS of online services, when the MPKI of online services rises, the number of batch jobs to be co-located on the same machine should decrease.

Keywords:
Computer science Bottleneck Workload Cloud computing Job scheduler Batch processing Zipf's law Scheduling (production processes) Operating system Cluster (spacecraft) Idle Distributed computing Real-time computing Embedded system

Metrics

39
Cited By
5.10
FWCI (Field Weighted Citation Impact)
36
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Cloud Computing and Resource Management
Physical Sciences →  Computer Science →  Information Systems
IoT and Edge/Fog Computing
Physical Sciences →  Computer Science →  Computer Networks and Communications
Advanced Data Storage Technologies
Physical Sciences →  Computer Science →  Computer Networks and Communications

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.