JOURNAL ARTICLE

Auto-Scaling Cloud-Based Memory-Intensive Applications

Abstract

Today, Cloud providers offer simplistic scaling policies that rely on thresholds that force tenants to have a priori knowledge of their workloads. We develop a new method for scaling memory-intensive workloads that needs no thresholds. This makes it worry-free for tenants, and it adapts even as workloads evolve. This is especially hard for memory-bound applications where even a small decrease in the amount of memory available can have a dramatic, almost unbounded impact on performance. Hence, sizing a machine's physical memory correctly is critical to application performance and operating cost. To determine a natural threshold for memory-intensive applications, our approach automatically analyzes an application's miss ratio curve (MRC) and models it as a hyperbola. Intuitively, a memory scaling policy should operate at the point where the curve flattens: that is, at its intersection with its latus rectum (LR). Our system uses a new approach to constructing and analyzing MRCs at run time that captures memory references from a slice of any scalable application as it executes on standard virtual machines from any major Cloud provider. We demonstrate with multiple applications running on Amazon Web Services (AWS) and Microsoft Azure. Our implementation and evaluation show that, though the LR doesn't require tenants to set thresholds, it is effective in scaling memory-intensive workloads to save on operating costs while avoiding queuing, thrashing, or collapse. It increases throughput by 1.5× and reduces queuing delay by 2× in our evaluation.

Keywords:
Computer science Cloud computing Scalability Thrashing Queueing theory Throughput Operating system Scaling Distributed computing Parallel computing Computer network

Metrics

4
Cited By
0.85
FWCI (Field Weighted Citation Impact)
26
Refs
0.80
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Cloud Computing and Resource Management
Physical Sciences →  Computer Science →  Information Systems
IoT and Edge/Fog Computing
Physical Sciences →  Computer Science →  Computer Networks and Communications
Software System Performance and Reliability
Physical Sciences →  Computer Science →  Computer Networks and Communications

Related Documents

JOURNAL ARTICLE

Auto Scaling Solutions for Cloud Applications

Wagdy Anis AzizAmir A AmmarJohn Soliman

Journal:   International Journal of Simulation Systems Science & Technology Year: 2023
JOURNAL ARTICLE

An Autonomic Auto-scaling Controller for Cloud Based Applications

María Goicoechea de JorgeAntônio Carlos

Journal:   Greater South Information System Year: 2013
JOURNAL ARTICLE

An Autonomic Auto-scaling Controller for Cloud Based Applications

María Goicoechea de JorgeAntônio Carlos

Journal:   Greater South Information System Year: 2013
JOURNAL ARTICLE

An Autonomic Auto-scaling Controller for Cloud Based Applications

María Goicoechea de JorgeAntônio Carlos

Journal:   International Journal of Advanced Computer Science and Applications Year: 2013 Vol: 4 (9)
JOURNAL ARTICLE

Auto-Scaling Approach for Cloud based Mobile Learning Applications

Amani Nasser AlmutlaqYassine

Journal:   International Journal of Advanced Computer Science and Applications Year: 2019 Vol: 10 (1)
© 2026 ScienceGate Book Chapters — All rights reserved.