Resource Allocation in UAV-D2D Networks: A Scalable Heterogeneous Multi-Agent Deep Reinforcement Learning Approach

Huayuan Wang; Hui Li; Xiaoliang Wang; Shilin Xia; Tao Liu; Ruonan Wang

doi:10.3390/electronics13224401

ScienceGate Book Chapters

JOURNAL ARTICLE

Resource Allocation in UAV-D2D Networks: A Scalable Heterogeneous Multi-Agent Deep Reinforcement Learning Approach

Huayuan Wang Hui Li Xiaoliang Wang Shilin Xia Tao Liu Ruonan Wang

Year: 2024 Journal: Electronics Vol: 13 (22)Pages: 4401-4401 Publisher: Multidisciplinary Digital Publishing Institute

DOI: 10.3390/electronics13224401

Get Full-Text PDF Get Analytical Report

Abstract

In unmanned aerial vehicle (UAV)-assisted device-to-device (D2D) caching networks, the uncertainty from unpredictable content demands and variable user positions poses a significant challenge for traditional optimization methods, often making them impractical. Multi-agent deep reinforcement learning (MADRL) offers significant advantages in optimizing multi-agent system decisions and serves as an effective and practical alternative. However, its application in large-scale dynamic environments is severely limited by the curse of dimensionality and communication overhead. To resolve this problem, we develop a scalable heterogeneous multi-agent mean-field actor-critic (SH-MAMFAC) framework. The framework treats ground users (GUs) and UAVs as distinct agents and designs cooperative rewards to convert the resource allocation problem into a fully cooperative game, enhancing global network performance. We also implement a mixed-action mapping strategy to handle discrete and continuous action spaces. A mean-field MADRL framework is introduced to minimize individual agent training loads while enhancing total cache hit probability (CHP). The simulation results show that our algorithm improves CHP and reduces transmission delay. A comparative analysis with existing mainstream deep reinforcement learning (DRL) algorithms shows that SH-MAMFAC significantly reduces training time and maintains high CHP as GU count grows. Additionally, by comparing with SH-MAMFAC variants that do not include trajectory optimization or power control, the proposed joint design scheme significantly reduces transmission delay.

Keywords:

Reinforcement learning Scalability Computer science Resource allocation Distributed computing Artificial intelligence Resource (disambiguation) Computer network Database

Metrics

Cited By

2.64

FWCI (Field Weighted Citation Impact)

Refs

0.88

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

UAV Applications and Optimization

Physical Sciences → Engineering → Aerospace Engineering

Distributed Control Multi-Agent Systems

Physical Sciences → Computer Science → Computer Networks and Communications

Video Surveillance and Tracking Methods

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Resource Allocation in UAV-D2D Networks: A Scalable Heterogeneous Multi-Agent Deep Reinforcement Learning Approach

Abstract

Metrics

Citation History

Topics

Related Documents

Deep Multi-Agent Reinforcement Learning for Resource Allocation in D2D Communication Underlaying Cellular Networks

Multi-Agent Reinforcement Learning-Based Resource Allocation for UAV Networks

Multi-Agent Power and Resource Allocation for D2D Communications: A Deep Reinforcement Learning Approach

Optimal D2D Resource Allocation in Heterogeneous Cellular Networks by Decentralized Multi-Agent Deep Q-Learning

Resource Allocation for Multi-UAV Assisted IoT Networks: A Deep Reinforcement Learning Approach