Traditional rule-based auto-scaling often struggles with complex cloud workloads. This study develops a reinforcement-learning-based auto-scaling agent that dynamically adjusts compute resources based on predicted demand. Experiments show improved cost efficiency and reduced latency under unpredictable load patterns. The results highlight the potential of AI-based scaling for high-traffic cloud systems.
Sudip PoudelKushal Sharma MarasiniLokesh BhattDaya Sagar Baral
Spyridon ChouliarasStelios Sotiriadis
Wagdy Anis AzizAmir A AmmarJohn Soliman
Uttom AkashPartha Protim PaulAhsan Habib