Abstract

This article addresses the issue of efficient scaling of microservice architecture under variable load conditions and limited resources. Special attention is given to the use of automatic scaling mechanisms in Kubernetes, such as the Horizontal Pod Autoscaler and Vertical Pod Autoscaler, as well as the selection and interpretation of load metrics that form the basis for scaling decisions. It explores how different scaling approaches affect system performance and infrastructure operating costs. It also investigates how scaling strategies, the degree of automation, and load balancing configurations influence service stability, resource utilization, and the overall quality of distributed system operation.

Keywords:
Microservices Computer science Scaling Load balancing (electrical power) Operating system Geology Mathematics Cloud computing Geodesy

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
5
Refs
0.24
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Cloud Computing and Resource Management
Physical Sciences →  Computer Science →  Information Systems
Software System Performance and Reliability
Physical Sciences →  Computer Science →  Computer Networks and Communications
Software-Defined Networks and 5G
Physical Sciences →  Computer Science →  Computer Networks and Communications
© 2026 ScienceGate Book Chapters — All rights reserved.