DEEP REINFORCEMENT LEARNING-BASED RESOURCE ALLOCATION IN MASSIVE MIMONOMA SYSTEMS

Pham Hoai An; Nguyen Dung; Nguyen Thi Xuan Uyen; Nguyen Thai Cong Nghia; Ngo Minh Nghia

doi:10.5121/ijcnc.2025.17601

ScienceGate Book Chapters

JOURNAL ARTICLE

DEEP REINFORCEMENT LEARNING-BASED RESOURCE ALLOCATION IN MASSIVE MIMONOMA SYSTEMS

Pham Hoai An Nguyen Dung Nguyen Thi Xuan Uyen Nguyen Thai Cong Nghia Ngo Minh Nghia

Year: 2025 Journal: International journal of Computer Networks & Communications Vol: 17 (6)Pages: 01-20

DOI: 10.5121/ijcnc.2025.17601

Get Full-Text PDF Get Analytical Report

Abstract

Massive MIMO systems with preconfigured spatial beams efficiently serve near-field (NF) users, while farfield (FF) users can be multiplexed on the same beams using non-orthogonal multiple access (NOMA). To realistically capture propagation, the spherical wave model (SWM) is employed for NF channels and the plane wave model (PWM) for FF channels, reflecting the distinct near- and far-field regions. While conventional optimization approaches such as successive convex approximation (SCA) and branch-andbound (BB) suffer from local optimality or prohibitive complexity, recent advances in deep learning have enabled scalable and adaptive solutions for wireless resource allocation. On this basis, a resource allocation strategy is developed using the Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm, where the base station acts as an agent that dynamically adjusts power and allocation coefficients to maximize the sum throughput of FF users. Simulation results show that the proposed DRLbased method can approach, and in some cases match, deterministic SCA at high SNR, while consistently outperforming randomly initialized SCA in medium-to-high SNR regimes. Compared to optimization-based baselines, the TD3 approach eliminates iterative problem reformulation, reduces computational complexity, and provides stronger adaptability to dynamic channels and user mobility.

Keywords:

Resource allocation Reinforcement learning Throughput Base station Scalability Adaptability Resource management (computing) MIMO Iterative method

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.61

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Advanced Wireless Communication Technologies

Physical Sciences → Engineering → Electrical and Electronic Engineering

Advanced MIMO Systems Optimization

Physical Sciences → Engineering → Electrical and Electronic Engineering

Millimeter-Wave Propagation and Modeling

Physical Sciences → Engineering → Electrical and Electronic Engineering

DEEP REINFORCEMENT LEARNING-BASED RESOURCE ALLOCATION IN MASSIVE MIMONOMA SYSTEMS

Abstract

Metrics

Topics

Related Documents

Deep Reinforcement Learning for Resource Allocation in Massive MIMO

Deep Reinforcement Learning Based Resource Allocation for Network Slicing With Massive MIMO

Deep Reinforcement Learning Based Uplink Resource Allocation in Open RAN Systems

Deep Reinforcement Learning based Resource Allocation in NOMA

Aircraft Resource Allocation Based on Deep Reinforcement Learning