Deep Reinforcement Learning-Based Resource Allocation in Cooperative UAV-Assisted Wireless Networks

Phuong Luong; François Gagnon; Le‐Nam Tran; Fabrice Labeau

doi:10.1109/twc.2021.3086503

ScienceGate Book Chapters

JOURNAL ARTICLE

Deep Reinforcement Learning-Based Resource Allocation in Cooperative UAV-Assisted Wireless Networks

Phuong Luong François Gagnon Le‐Nam Tran Fabrice Labeau

Year: 2021 Journal: IEEE Transactions on Wireless Communications Vol: 20 (11)Pages: 7610-7625 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/twc.2021.3086503

Get Full-Text PDF Get Analytical Report

Abstract

We consider the downlink of an unmanned aerial vehicle (UAV) assisted cellular network consisting of multiple cooperative UAVs, whose operations are coordinated by a central ground controller using wireless fronthaul links, to serve multiple ground user equipments (UEs). A problem of jointly designing UAVs’ positions, transmit beamforming, as well as UAV-UE association is formulated in the form of mixed integer nonlinear programming (MINLP) to maximize the sum UEs’ achievable rate subject to limited fronthaul capacity constraints. Solving the considered problem is hard owing to its non-convexity and the unavailability of channel state information (CSI) due to the movement of UAVs. To tackle these effects, we propose a novel algorithm comprising of two distinguishing features: (i) exploiting a deep Q-learning approach to tackle the issue of CSI unavailability for determining UAVs’ positions, (ii) developing a difference of convex algorithm (DCA) to efficiently solve for the UAV’s transmit beamforming and UAV-UE association. The proposed algorithm recursively solves the problem of interest until convergence, where each recursion executes two steps. In the first step, the deep Q-learning (DQL) algorithm allows UAVs to learn the overall network state and account for the joint movement of all UAVs to adapt their locations. In the second step, given the determined UAVs’ positions from the DQL algorithm, the DCA iteratively solves a convex approximate subproblem of the original non-convex MINLP problem with the updated parameters, where the problem’s variables are transmit beamforming and UAV-UE association. Numerical results show that our design outperforms the existing algorithms in terms of algorithmic convergence and network performance with a gain of up to 70%.

Keywords:

Computer science Unavailability Beamforming Mathematical optimization Channel state information Wireless network Wireless Telecommunications link Convergence (economics) Reinforcement learning Computer network Artificial intelligence Mathematics Telecommunications

Metrics

Cited By

22.74

FWCI (Field Weighted Citation Impact)

Refs

1.00

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

UAV Applications and Optimization

Physical Sciences → Engineering → Aerospace Engineering

Advanced Wireless Communication Technologies

Physical Sciences → Engineering → Electrical and Electronic Engineering

Advanced MIMO Systems Optimization

Physical Sciences → Engineering → Electrical and Electronic Engineering

Deep Reinforcement Learning-Based Resource Allocation in Cooperative UAV-Assisted Wireless Networks

Abstract

Metrics

Citation History

Topics

Related Documents

Resource Allocation in UAV-Assisted Wireless Networks Using Reinforcement Learning

Deep Reinforcement Learning-Based Resource Allocation for Multi-UAV-Assisted Full-Duplex Wireless-Powered IoT Networks

Deep Learning Based Cooperative Resource Allocation in 5G Wireless Networks

Resource allocation for UAV-assisted 5G mMTC slicing networks using deep reinforcement learning

Resource Allocation for Multi-UAV Assisted IoT Networks: A Deep Reinforcement Learning Approach