Multi-Agent Reinforcement Learning-Based Resource Allocation for UAV Networks

Jingjing Cui; Yuanwei Liu; Arumugam Nallanathan

doi:10.1109/twc.2019.2935201

ScienceGate Book Chapters

JOURNAL ARTICLE

Multi-Agent Reinforcement Learning-Based Resource Allocation for UAV Networks

Jingjing Cui Yuanwei Liu Arumugam Nallanathan

Year: 2019 Journal: IEEE Transactions on Wireless Communications Vol: 19 (2)Pages: 729-743 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/twc.2019.2935201

Get Full-Text PDF Get Analytical Report

Abstract

Unmanned aerial vehicles (UAVs) are capable of serving as aerial base stations (BSs) for providing both cost-effective and on-demand wireless communications. This article investigates dynamic resource allocation of multiple UAVs enabled communication networks with the goal of maximizing long-term rewards. More particularly, each UAV communicates with a ground user by automatically selecting its communicating user, power level and subchannel without any information exchange among UAVs. To model the dynamics and uncertainty in environments, we formulate the long-term resource allocation problem as a stochastic game for maximizing the expected rewards, where each UAV becomes a learning agent and each resource allocation solution corresponds to an action taken by the UAVs. Afterwards, we develop a multi-agent reinforcement learning (MARL) framework that each agent discovers its best strategy according to its local observations using learning. More specifically, we propose an agent-independent method, for which all agents conduct a decision algorithm independently but share a common structure based on Q-learning. Finally, simulation results reveal that: 1) appropriate parameters for exploitation and exploration are capable of enhancing the performance of the proposed MARL based resource allocation algorithm; 2) the proposed MARL algorithm provides acceptable performance compared to the case with complete information exchanges among UAVs. By doing so, it strikes a good tradeoff between performance gains and information exchange overheads.

Keywords:

Reinforcement learning Computer science Resource allocation Base station Information exchange Resource management (computing) Resource (disambiguation) Distributed computing Q-learning Wireless Mathematical optimization Operations research Computer network Artificial intelligence Telecommunications Engineering

Metrics

519

Cited By

74.16

FWCI (Field Weighted Citation Impact)

Refs

1.00

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

UAV Applications and Optimization

Physical Sciences → Engineering → Aerospace Engineering

Distributed Control Multi-Agent Systems

Physical Sciences → Computer Science → Computer Networks and Communications

Energy Harvesting in Wireless Networks

Physical Sciences → Engineering → Electrical and Electronic Engineering

Multi-Agent Reinforcement Learning-Based Resource Allocation for UAV Networks

Abstract

Metrics

Citation History

Topics

Related Documents

Multi-agent reinforcement learning based resource allocation for vehicular networks

Resource Allocation in Vehicular Networks Based on Federated Multi-Agent Reinforcement Learning

Multi-Agent Deep Reinforcement Learning-Based Resource Allocation for Cognitive Radio Networks

Multi-Agent Deep Reinforcement Learning Based Distributed Resource Allocation

Multi-Agent Reinforcement Learning Based Dynamic Bandwidth Resource Allocation in Vehicle Edge Networks