JOURNAL ARTICLE

Resource Allocation Using Deep Reinforcement Learning in GEO Multibeam Satellite System

Abstract

Future Internet of Things (IoT) is becoming more and more common in multi-beam satellite communications. However, due to the constrained system resource, the next generation of Geostationary Earth Orbit (GEO) multibeam satellite systems will focus on achieving flexible joint power and bandwidth allocation. Therefore, a GEO multi-beam satellite resource optimization scheme is proposed in this paper. We first model the optimization problem as a Markov decision process (MDP) due to the continuity and variation of the system resource. Then we propose a deep reinforcement learning (DRL) algorithm based on Asynchronous Advantage Actor-critic (A3C) to jointly allocate power and bandwidth whose goal is to satisfy the beam traffic demand. The algorithm aims to increase the system throughput and user fairness. The simulation results demonstrate that our proposed algorithm achieves significant advantages over the existing algorithms.

Keywords:
Computer science Reinforcement learning Geostationary orbit Markov decision process Resource allocation Communications satellite Resource management (computing) Bandwidth (computing) Throughput Distributed computing Bandwidth allocation Satellite Real-time computing Markov process Mathematical optimization Computer network Wireless Artificial intelligence Telecommunications Engineering

Metrics

5
Cited By
2.60
FWCI (Field Weighted Citation Impact)
16
Refs
0.89
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Satellite Communication Systems
Physical Sciences →  Engineering →  Aerospace Engineering
Age of Information Optimization
Physical Sciences →  Computer Science →  Computer Networks and Communications
IoT Networks and Protocols
Physical Sciences →  Engineering →  Electrical and Electronic Engineering

Related Documents

JOURNAL ARTICLE

Dynamic Resource Allocation With Deep Reinforcement Learning in Multibeam Satellite Communication

Danhao DengChaowei WangMingliang PangWeidong Wang

Journal:   IEEE Wireless Communications Letters Year: 2022 Vol: 12 (1)Pages: 75-79
BOOK-CHAPTER

Deep Reinforcement Learning-Based Channel and Power Allocation in Multibeam LEO Satellite Systems

Junrong LiFuzhou PengXijun WangXiang Chen

Lecture notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering Year: 2024 Pages: 103-116
JOURNAL ARTICLE

An online power allocation algorithm based on deep reinforcement learning in multibeam satellite systems

Pei ZhangXiaohui WangZhiguo MaShuaijun LiuJunde Song

Journal:   International Journal of Satellite Communications and Networking Year: 2020 Vol: 38 (5)Pages: 450-461
© 2026 ScienceGate Book Chapters — All rights reserved.