Shiyang ZhouYufan ChengXia Lei
This paper considers an unmanned aerial vehicle (UAV)-assisted downlink transmission. To maximize the aver-age achievable channel capacity among the ground users, a multi-agent low-bias deep reinforcement learning (MA-LB-DRL) scheme is proposed to solve the joint optimization problem of trajectory design, channel selection and power control. Firstly, considering the NP-hard, non-convex, and mixed-integer features, the joint optimization problem is decoupled into two sub-problems, which deal with the integer-action and continuous-action problem respectively. Secondly, an MA-LB-DRL scheme, which consists of a multi-agent double deep Q-network (MAD-DQN) for channel selection and a multi-agent twin delayed deep deterministic policy gradient (MATD3PG) for trajectory design and power control, is proposed to overcome the overestimation bias via reducing value function approximation error at the expense of tiny complexity. Finally, the simulation results demon-strate that the proposed scheme achieves high channel capacity than the benchmark schemes.
Jingjing CuiYuanwei LiuArumugam Nallanathan
Nawaf Qasem Hamood OthmanJinglei LiQinghai Yang
Phuong LuongFrançois GagnonFabrice Labeau
Khoi Khac NguyenSaeed R. KhosraviradDaniel Benevides da CostaLong D. NguyenTrung Q. Duong