Deep Reinforcement Learning for Trajectory Design and Power Allocation in UAV Networks

Nan Zhao; Yiqiang Cheng; Yiyang Pei; Ying‐Chang Liang; Dusit Niyato

doi:10.1109/icc40277.2020.9149196

ScienceGate Book Chapters

JOURNAL ARTICLE

Deep Reinforcement Learning for Trajectory Design and Power Allocation in UAV Networks

Nan Zhao Yiqiang Cheng Yiyang Pei Ying‐Chang Liang Dusit Niyato

Year: 2020 Pages: 1-6

DOI: 10.1109/icc40277.2020.9149196

Get Full-Text PDF Get Analytical Report

Abstract

Unmanned aerial vehicle (UAV) is considered to be a key component in the next-generation cellular networks. Considering the non-convex characteristic of the trajectory design and power allocation problem, it is difficult to obtain the optimal joint strategy in UAV-assisted cellular networks. In this paper, a reinforcement learning-based approach is proposed to obtain the maximum long-term network utility while meeting with user equipments' quality of service requirement. The Markov decision process (MDP) is formulated with the design of state, action space, and reward function. In order to achieve the joint optimal policy of trajectory design and power allocation, deep reinforcement learning approach is investigated. Due to the continuous action space of the MDP model, deep deterministic policy gradient approach is presented. Simulation results show that the proposed algorithm outperforms other approaches on overall network utility performance with higher system capacity and faster processing speed.

Keywords:

Reinforcement learning Markov decision process Computer science Trajectory Q-learning Mathematical optimization State space Markov process Component (thermodynamics) Function (biology) Artificial intelligence Mathematics

Metrics

Cited By

5.38

FWCI (Field Weighted Citation Impact)

Refs

0.96

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

UAV Applications and Optimization

Physical Sciences → Engineering → Aerospace Engineering

Distributed Control Multi-Agent Systems

Physical Sciences → Computer Science → Computer Networks and Communications

Smart Parking Systems Research

Physical Sciences → Engineering → Building and Construction

Deep Reinforcement Learning for Trajectory Design and Power Allocation in UAV Networks

Abstract

Metrics

Citation History

Topics

Related Documents

Multi-Agent Deep Reinforcement Learning for Trajectory Design and Power Allocation in Multi-UAV Networks

Trajectory Design and Resource Allocation for Multi-UAV Networks: Deep Reinforcement Learning Approaches

Power Allocation in Multi-Cell Networks Using Deep Reinforcement Learning

Deep Reinforcement Learning Based 3D UAV Trajectory Design and Frequency Band Allocation

Deep Reinforcement Learning-Based Adaptive Power Allocation for Power Line Communication Networks