Inter-Cell Network Slicing With Transfer Learning Empowered Multi-Agent Deep Reinforcement Learning

Tianlun Hu; Qi Liao; Qiang Liu; Georg Carle

doi:10.1109/ojcoms.2023.3273310

ScienceGate Book Chapters

JOURNAL ARTICLE

Inter-Cell Network Slicing With Transfer Learning Empowered Multi-Agent Deep Reinforcement Learning

Tianlun Hu Qi Liao Qiang Liu Georg Carle

Year: 2023 Journal: IEEE Open Journal of the Communications Society Vol: 4 Pages: 1141-1155 Publisher: IEEE Communications Society

DOI: 10.1109/ojcoms.2023.3273310

Get Full-Text PDF Get Analytical Report

Abstract

Network slicing enables operators to efficiently support diverse applications on a common physical infrastructure. The ever-increasing densification of network deployment leads to complex and non-trivial inter-cell interference, which requires more than inaccurate analytic models to dynamically optimize resource management for network slices. In this paper, we develop a DIRP algorithm with multiple deep reinforcement learning (DRL) agents to cooperatively optimize resource partition in individual cells to fulfill the requirements of each slice, based on two alternative reward functions. Nevertheless, existing DRL approaches usually tie the pretrained model parameters to specific network environments with poor transferability, which raises practical deployment concerns in large-scale mobile networks. Hence, we design a novel transfer learning-aided DIRP (TL-DIRP) algorithm to ease the transfer of DIRP agents across different network environments in terms of sample efficiency, model reproducibility, and algorithm scalability. The TL-DIRP algorithm first centrally trains a generalized model and then transfers the "generalist" to each local agent as "specialist" with distributed finetuning and execution. TL-DIRP consists of two steps: 1) centralized training of a generalized distributed model, 2) transferring the "generalist" to each "specialist" with distributed finetuning and execution. The numerical results show that not only DIRP outperforms existing baseline approaches in terms of faster convergence and higher reward, but more importantly, TL-DIRP significantly improves the service performance, with reduced exploration cost, accelerated convergence rate, and enhanced model reproducibility. As compared to a traffic-aware baseline, TL-DIRP provides about 15% less violation ratio of the quality of service (QoS) for the worst slice service and 8.8% less violation on the average service QoS.

Keywords:

Computer science Scalability Reinforcement learning Distributed computing Baseline (sea) Quality of service Software deployment Slicing Transfer of learning Partition (number theory) Convergence (economics) Artificial intelligence Computer network

Metrics

Cited By

1.76

FWCI (Field Weighted Citation Impact)

Refs

0.73

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Software-Defined Networks and 5G

Physical Sciences → Computer Science → Computer Networks and Communications

Machine Learning and ELM

Physical Sciences → Computer Science → Artificial Intelligence

Full-Duplex Wireless Communications

Physical Sciences → Engineering → Electrical and Electronic Engineering

Inter-Cell Network Slicing With Transfer Learning Empowered Multi-Agent Deep Reinforcement Learning

Abstract

Metrics

Citation History

Topics

Related Documents

Inter-Cell Slicing Resource Partitioning via Coordinated Multi-Agent Deep Reinforcement Learning

Adaptive Network Slicing Using Multi-Agent Deep Reinforcement Learning

Network Slicing via Transfer Learning aided Distributed Deep Reinforcement Learning

Network slicing for vehicular communications: a multi-agent deep reinforcement learning approach

A Multi-Agent Reinforcement Learning Architecture for Network Slicing Orchestration