Auto-Scaling Cloud-Based Memory-Intensive Applications

Joe H. Novak; Sneha Kumar Kasera; Ryan Stutsman

doi:10.1109/cloud49709.2020.00042

ScienceGate Book Chapters

JOURNAL ARTICLE

Auto-Scaling Cloud-Based Memory-Intensive Applications

Joe H. Novak Sneha Kumar Kasera Ryan Stutsman

Year: 2020 Pages: 229-237

DOI: 10.1109/cloud49709.2020.00042

Get Full-Text PDF Get Analytical Report

Abstract

Today, Cloud providers offer simplistic scaling policies that rely on thresholds that force tenants to have a priori knowledge of their workloads. We develop a new method for scaling memory-intensive workloads that needs no thresholds. This makes it worry-free for tenants, and it adapts even as workloads evolve. This is especially hard for memory-bound applications where even a small decrease in the amount of memory available can have a dramatic, almost unbounded impact on performance. Hence, sizing a machine's physical memory correctly is critical to application performance and operating cost. To determine a natural threshold for memory-intensive applications, our approach automatically analyzes an application's miss ratio curve (MRC) and models it as a hyperbola. Intuitively, a memory scaling policy should operate at the point where the curve flattens: that is, at its intersection with its latus rectum (LR). Our system uses a new approach to constructing and analyzing MRCs at run time that captures memory references from a slice of any scalable application as it executes on standard virtual machines from any major Cloud provider. We demonstrate with multiple applications running on Amazon Web Services (AWS) and Microsoft Azure. Our implementation and evaluation show that, though the LR doesn't require tenants to set thresholds, it is effective in scaling memory-intensive workloads to save on operating costs while avoiding queuing, thrashing, or collapse. It increases throughput by 1.5× and reduces queuing delay by 2× in our evaluation.

Keywords:

Computer science Cloud computing Scalability Thrashing Queueing theory Throughput Operating system Scaling Distributed computing Parallel computing Computer network

Metrics

Cited By

0.85

FWCI (Field Weighted Citation Impact)

Refs

0.80

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Cloud Computing and Resource Management

Physical Sciences → Computer Science → Information Systems

IoT and Edge/Fog Computing

Physical Sciences → Computer Science → Computer Networks and Communications

Software System Performance and Reliability

Physical Sciences → Computer Science → Computer Networks and Communications

Auto-Scaling Cloud-Based Memory-Intensive Applications

Abstract

Metrics

Citation History

Topics

Related Documents

Auto Scaling Solutions for Cloud Applications

An Autonomic Auto-scaling Controller for Cloud Based Applications

An Autonomic Auto-scaling Controller for Cloud Based Applications

An Autonomic Auto-scaling Controller for Cloud Based Applications

Auto-Scaling Approach for Cloud based Mobile Learning Applications