JOURNAL ARTICLE

DERP: A Deep Reinforcement Learning Cloud System for Elastic Resource Provisioning

Abstract

Modern large scale computer clusters benefit significantly from elasticity. Elasticity allows a cluster to dynamically allocate computer resources, based on the user's fluctuating workload demands. Many cloud providers use threshold-based approaches, which have been proven to be difficult to configure and optimise, while others use reinforcement learning and decision-tree approaches, which struggle when having to handle large multidimensional cluster states. In this work we use Deep Reinforcement learning techniques to achieve automatic elasticity. We use three different approaches of a Deep Reinforcement learning agent, called DERP (Deep Elastic Resource Provisioning), that takes as input the current multi-dimensional state of a cluster and manages to train and converge to the optimal elasticity behaviour after a finite amount of training steps. The system automatically decides and proceeds on requesting/releasing VM resources from the provider and orchestrating them inside a NoSQL cluster according to user-defined policies/rewards. We compare our agent to state-of-the-art, Reinforcement learning and decision-tree based, approaches in demanding simulation environments and show that it gains rewards up to 1.6 times better on its lifetime. We then test our approach in a real life cluster environment and show that the system resizes clusters in real-time and adapts its performance through a variety of demanding optimisation strategies, input and training loads.

Keywords:
Reinforcement learning Computer science Provisioning Elasticity (physics) Cloud computing Distributed computing Workload Artificial intelligence Machine learning Computer network Operating system

Metrics

58
Cited By
6.70
FWCI (Field Weighted Citation Impact)
26
Refs
0.97
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Cloud Computing and Resource Management
Physical Sciences →  Computer Science →  Information Systems
IoT and Edge/Fog Computing
Physical Sciences →  Computer Science →  Computer Networks and Communications
Data Stream Mining Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Deep Reinforcement Learning for Cloud Resource Provisioning

Ravikumar Perumallaplli

Journal:   SSRN Electronic Journal Year: 2025
BOOK-CHAPTER

A Deep Reinforcement Learning Based Resource Autonomic Provisioning Approach for Cloud Services

Zong QingXiangwei ZhengYi WeiHongfeng Sun

Lecture notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering Year: 2021 Pages: 132-153
JOURNAL ARTICLE

DRL-cloud: Deep reinforcement learning-based resource provisioning and task scheduling for cloud service providers

Mingxi ChengJi LiShahin Nazarian

Journal:   2018 23rd Asia and South Pacific Design Automation Conference (ASP-DAC) Year: 2018 Pages: 129-134
JOURNAL ARTICLE

DRL-cloud: deep reinforcement learning-based resource provisioning and task scheduling for cloud service providers

Mingxi ChengJi LiShahin Nazarian

Journal:   Asia and South Pacific Design Automation Conference Year: 2018 Pages: 129-134
© 2026 ScienceGate Book Chapters — All rights reserved.