JOURNAL ARTICLE

Multi-Agent Reinforcement Learning Aided Resource Allocation With SARSA in UAV Networks

Abstract

This article rigorously explores the self-directed allocation of resources in communication networks powered by multiple Unmanned Aerial Vehicles (UAVs), aimed at optimizing long-term gains. To model the complexities of dynamic and unpredictable environments, we formulate the challenge of long-term resource allocation as a stochastic game. Our primary objective is to maximize expected rewards, with each UAV functioning as a learning agent and each resource allocation solution corresponding to an action executed by the UAVs within a Multi-Agent Reinforcement Learning (MARL) framework. Moreover, we introduce an agent-agnostic approach, where all agents independently implement a decision algorithm, yet maintain a shared structure based on SARSA (State-Action- Reward-State-Action). Our simulations demonstrate that the proposed algorithm exhibits a rarity that is commendable, particularly when compared to scenarios that demand exhaustive information exchange among the UAVs.

Keywords:
Reinforcement learning Computer science Resource allocation Multi-agent system Resource management (computing) Resource (disambiguation) Artificial intelligence Distributed computing Computer network

Metrics

3
Cited By
1.56
FWCI (Field Weighted Citation Impact)
13
Refs
0.86
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

UAV Applications and Optimization
Physical Sciences →  Engineering →  Aerospace Engineering
Distributed Control Multi-Agent Systems
Physical Sciences →  Computer Science →  Computer Networks and Communications
Cognitive Radio Networks and Spectrum Sensing
Physical Sciences →  Computer Science →  Computer Networks and Communications
© 2026 ScienceGate Book Chapters — All rights reserved.