Adaptive Resource Allocation via Multi-Agent Reinforcement Learning for Dynamic Load Balancing in AI Workloads

Revista, Zen; IA, 10

doi:10.5281/zenodo.17821068

ScienceGate Book Chapters

JOURNAL ARTICLE

Adaptive Resource Allocation via Multi-Agent Reinforcement Learning for Dynamic Load Balancing in AI Workloads

Revista, Zen IA, 10

Year: 2025 Journal: Zenodo (CERN European Organization for Nuclear Research) Publisher: European Organization for Nuclear Research

DOI: 10.5281/zenodo.17821068

Get Full-Text PDF Get Analytical Report

Abstract

This paper explores the application of multi-agent reinforcement learning (MARL) to address the challenge of dynamic load balancing in AI workloads. Modern AI applications often involve complex computational tasks that require efficient resource allocation across distributed systems. Traditional load balancing techniques struggle to adapt to the rapidly changing demands and heterogeneous nature of these workloads. We propose a novel MARL framework where intelligent agents collaboratively learn to optimize resource allocation decisions in real-time. Each agent is responsible for managing a subset of resources and interacts with its environment to learn effective strategies for distributing tasks. We design a reward function that encourages efficient resource utilization, minimizes task completion times, and promotes fairness among agents. Our experiments demonstrate that the proposed MARL approach significantly outperforms conventional load balancing algorithms in terms of overall system throughput, task latency, and resource utilization. We also analyze the emergent behavior of the agents and provide insights into the learned allocation strategies. The results highlight the potential of MARL for achieving dynamic and adaptive load balancing in complex AI-driven environments.

Keywords:

Reinforcement learning Load balancing (electrical power) Resource allocation Task (project management) Resource (disambiguation) Function (biology) Load management

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.71

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Distributed and Parallel Computing Systems

Physical Sciences → Computer Science → Computer Networks and Communications

Reinforcement Learning in Robotics

Physical Sciences → Computer Science → Artificial Intelligence

IoT and Edge/Fog Computing

Physical Sciences → Computer Science → Computer Networks and Communications

Adaptive Resource Allocation via Multi-Agent Reinforcement Learning for Dynamic Load Balancing in AI Workloads

Abstract

Metrics

Topics

Related Documents

Adaptive Resource Allocation via Multi-Agent Reinforcement Learning for Dynamic Load Balancing in AI Workloads

Adaptive manufacturing: dynamic resource allocation using multi-agent reinforcement learning

Attention-Augmented Multi-Agent Reinforcement Learning for Adaptive Dynamic Resource Allocation Method

Dynamic Resource Allocation Method Based on Multi-Agent Deep Reinforcement Learning

Dynamic Multi-Agent Reinforcement Learning Based Load Balancing on Software Defined Networking