JOURNAL ARTICLE

Optimizing Latency-Sensitive AI Applications Through Edge-Cloud Collaboration

Jinsong WuHongbo WangKun QianEnmiao Feng

Year: 2023 Journal:   Journal of Advanced Computing Systems Vol: 3 (3)Pages: 19-33

Abstract

This paper presents a novel framework for optimizing latency-sensitive AI applications through intelligent edge-cloud collaboration. The proposed approach addresses critical challenges in deploying computationally intensive AI workloads across distributed computing environments while meeting stringent timing requirements. The framework introduces an adaptive workload partitioning mechanism that dynamically distributes computational tasks based on application-specific latency requirements, resource availability, and network conditions. A comprehensive resource allocation strategy optimizes utilization across the computing continuum through specialized scheduling algorithms that prioritize time-sensitive operations. Communication protocol optimizations reduce data transfer overhead through context-aware compression techniques and adaptive packet sizing. Experimental evaluation conducted across heterogeneous computing environments demonstrates significant performance improvements, achieving latency reductions of 50-62% compared to baseline approaches. Resource utilization patterns show increased edge resource efficiency (83.4%) while reducing cloud resource consumption (31.1%). Energy efficiency metrics indicate substantial improvements across application categories, with energy-per-transaction reductions ranging from 50.0% to 60.6%. The framework maintains performance standards under challenging operational conditions, including network congestion and limited resource availability, validating its applicability for real-world deployment scenarios. The results demonstrate that intelligent edge-cloud collaboration can significantly enhance performance for latency-sensitive AI applications while improving overall system efficiency.

Keywords:
Cloud computing Computer science Enhanced Data Rates for GSM Evolution Latency (audio) Operating system Artificial intelligence Telecommunications

Metrics

2
Cited By
1.24
FWCI (Field Weighted Citation Impact)
0
Refs
0.84
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Recommender Systems and Techniques
Physical Sciences →  Computer Science →  Information Systems
Innovation in Digital Healthcare Systems
Health Sciences →  Health Professions →  Health Information Management
IoT and Edge/Fog Computing
Physical Sciences →  Computer Science →  Computer Networks and Communications
© 2026 ScienceGate Book Chapters — All rights reserved.