JOURNAL ARTICLE

Deep Reinforcement Learning Based Mobility Load Balancing Under Multiple Behavior Policies

Abstract

The mobility load balancing (MLB) in self-organizing networks (SONs) is designed to automatically resolve the mismatch between network resource distribution and network traffic demand. In this paper, we propose an off-policy deep reinforcement learning (DRL) based MLB framework to balance the load distribution among all the cells. Our main contribution is three-fold. First, we propose to use off-policy RL with multiple behavior policies to autonomously learn the optimal MLB policy without any prior knowledge over the underlying wireless environments. Second, we propose a corresponding DRL-based MLB model by using deep neural networks as the function approximators to improve the generalization ability over complex system states. Third, we propose an asynchronous parallel learning framework for MLB to improve the training efficiency in a collaborative manner. Experimental results show that our proposed DRL-based MLB model can outperform the existing approaches considerably.

Keywords:
Reinforcement learning Computer science Asynchronous communication Generalization Artificial intelligence Artificial neural network Distributed computing Function (biology) Machine learning Computer network

Metrics

15
Cited By
0.91
FWCI (Field Weighted Citation Impact)
23
Refs
0.76
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced MIMO Systems Optimization
Physical Sciences →  Engineering →  Electrical and Electronic Engineering
Wireless Networks and Protocols
Physical Sciences →  Computer Science →  Computer Networks and Communications
Energy Harvesting in Wireless Networks
Physical Sciences →  Engineering →  Electrical and Electronic Engineering
© 2026 ScienceGate Book Chapters — All rights reserved.