ProKube: Proactive Kubernetes Orchestrator for Inference in Heterogeneous Edge Computing

B. Ali; Muhammed Golec; Sukhpal Singh Gill; Félix Cuadrado; Steve Uhlig

doi:10.1002/nem.2298

ScienceGate Book Chapters

JOURNAL ARTICLE

ProKube: Proactive Kubernetes Orchestrator for Inference in Heterogeneous Edge Computing

B. Ali Muhammed Golec Sukhpal Singh Gill Félix Cuadrado Steve Uhlig

Year: 2024 Journal: International Journal of Network Management Vol: 35 (1) Publisher: Wiley

DOI: 10.1002/nem.2298

Get Full-Text PDF Get Analytical Report

Abstract

ABSTRACT Deep neural network (DNN) and machine learning (ML) models/ inferences produce highly accurate results demanding enormous computational resources. The limited capacity of end‐user smart gadgets drives companies to exploit computational resources in an edge‐to‐cloud continuum and host applications at user‐facing locations with users requiring fast responses. Kubernetes hosted inferences with poor resource request estimation results in service level agreement (SLA) violation in terms of latency and below par performance with higher end‐to‐end (E2E) delays. Lifetime static resource provisioning either hurts user experience for under‐resource provisioning or incurs cost with over‐provisioning. Dynamic scaling offers to remedy delay by upscaling leading to additional cost whereas a simple migration to another location offering latency in SLA bounds can reduce delay and minimize cost. To address this cost and delay challenges for ML inferences in the inherent heterogeneous, resource‐constrained, and distributed edge environment, we propose ProKube, which is a proactive container scaling and migration orchestrator to dynamically adjust the resources and container locations with a fair balance between cost and delay. ProKube is developed in conjunction with Google Kubernetes Engine (GKE) enabling cross‐cluster migration and/ or dynamic scaling. It further supports the regular addition of freshly collected logs into scheduling decisions to handle unpredictable network behavior. Experiments conducted in heterogeneous edge settings show the efficacy of ProKube to its counterparts cost greedy (CG), latency greedy (LG), and GeKube (GK). ProKube offers 68%, 7%, and 64% SLA violation reduction to CG, LG, and GK, respectively, and it improves cost by 4.77 cores to LG and offers more cost of 3.94 to CG and GK.

Keywords:

Computer science Inference Enhanced Data Rates for GSM Evolution Distributed computing Artificial intelligence

Metrics

Cited By

10.69

FWCI (Field Weighted Citation Impact)

Refs

0.97

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Cloud Computing and Resource Management

Physical Sciences → Computer Science → Information Systems

Distributed and Parallel Computing Systems

Physical Sciences → Computer Science → Computer Networks and Communications

IoT and Edge/Fog Computing

Physical Sciences → Computer Science → Computer Networks and Communications

ProKube: Proactive Kubernetes Orchestrator for Inference in Heterogeneous Edge Computing

Abstract

Metrics

Citation History

Topics

Related Documents

A Kubernetes-Oriented Edge Network Orchestrator for Heterogeneous Environment

GAIKube: Generative AI-Based Proactive Kubernetes Container Orchestration Framework for Heterogeneous Edge Computing

Computation Placement Orchestrator for Mobile-Edge Computing in Heterogeneous Vehicular Networks

Deep Reinforcement Learning Edge Workload Orchestrator for Vehicular Edge Computing

Network-integrated edge computing orchestrator for application placement