JOURNAL ARTICLE

Deep Reinforcement Learning Assisted UAV Trajectory and Resource Optimization for NOMA Networks

Abstract

Unmanned aerial vehicles (UAVs) are widely used as aerial base stations (BSs) to provide wireless communication services. In this paper, we consider the UAV's trajectory and power allocation design for downlink communication rate maximization in a UAV-enabled network in disaster areas or the cell fringe. Non-orthogonal multiple access (NOMA) is used to improve the spectrum efficiency of the entire network, while all users are roaming around randomly. The formulated problem is non-convex and the considered environment is dynamic. Such a problem is difficult to be solved via conventional optimization methods. Therefore, we propose a soft actor-critic (SAC) learning scheme to tackle the pertinent problem. Simulation results show that our proposed learning framework is more stable and has a faster convergence rate compared to baseline approaches.

Keywords:
Computer science Base station Roaming Reinforcement learning Telecommunications link Maximization Utility maximization Trajectory Wireless network Noma Trajectory optimization Optimization problem Resource allocation Wireless Cellular network Resource management (computing) Scheme (mathematics) Power control Transmitter power output Convergence (economics) Mathematical optimization Computer network Power (physics) Artificial intelligence Optimal control Telecommunications Algorithm

Metrics

3
Cited By
1.01
FWCI (Field Weighted Citation Impact)
15
Refs
0.83
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

UAV Applications and Optimization
Physical Sciences →  Engineering →  Aerospace Engineering
Advanced Wireless Communication Technologies
Physical Sciences →  Engineering →  Electrical and Electronic Engineering
Distributed Control Multi-Agent Systems
Physical Sciences →  Computer Science →  Computer Networks and Communications
© 2026 ScienceGate Book Chapters — All rights reserved.