Mean-Field Multi-Agent Reinforcement Learning for Adaptive Anti-Jamming Channel Selection in UAV Communications

Feng Du; Jun Li; Yan Lin; Zhe Wang; Yuwen Qian

doi:10.1109/wcsp55476.2022.10039304

ScienceGate Book Chapters

JOURNAL ARTICLE

Mean-Field Multi-Agent Reinforcement Learning for Adaptive Anti-Jamming Channel Selection in UAV Communications

Feng Du Jun Li Yan Lin Zhe Wang Yuwen Qian

Year: 2022 Pages: 910-915

DOI: 10.1109/wcsp55476.2022.10039304

Get Full-Text PDF Get Analytical Report

Abstract

In the large-scale anti-jamming UAV communication network, massive number of UAV users aim to compete for limited spectrum resources while fighting against possible external interference from malicious jammers. Specifically, each UAV-to-UAV (U2U) communication link targets at finding the optimal channel selection that maximizes its long-term expected achievable rate. We formulate the distributed multi-UAV anti-jamming problem as a partially observable stochastic game (POSG), where each UAV only has partial observability of the entire network environment due to the limited sensing capabilities. To deal with the complex interactions among large-scale UAVs, we simplify the POSG problem as a mean-field game, where each U2U link only interacts with the aggregate interference from the neighboring U2U links and the malicious jammers. We propose a soft mean-field Q learning (Soft-MFQ) algorithm to obtain the Nash equilibrium of the U2Us' channel selection policies in a model-free scenario. The simulation results show that the proposed algorithm outperforms other benchmark algorithms in terms of convergence speed and the average reward, especially when the number of UAVs is large.

Keywords:

Jamming Computer science Observability Reinforcement learning Channel (broadcasting) Benchmark (surveying) Convergence (economics) Interference (communication) Selection (genetic algorithm) Distributed computing Computer network Artificial intelligence Mathematics

Metrics

Cited By

1.71

FWCI (Field Weighted Citation Impact)

Refs

0.80

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Distributed Control Multi-Agent Systems

Physical Sciences → Computer Science → Computer Networks and Communications

UAV Applications and Optimization

Physical Sciences → Engineering → Aerospace Engineering

Security in Wireless Sensor Networks

Physical Sciences → Computer Science → Computer Networks and Communications

Mean-Field Multi-Agent Reinforcement Learning for Adaptive Anti-Jamming Channel Selection in UAV Communications

Abstract

Metrics

Citation History

Topics

Related Documents

Meta-Reinforcement Learning in Time-Varying UAV Communications: Adaptive Anti-Jamming Channel Selection

Adaptive mean field multi-agent reinforcement learning

Multi-Agent Reinforcement Learning Based Cognitive Anti-Jamming

Joint Power and Channel Selection for Anti-jamming Communications: A Reinforcement Learning Approach

Reinforcement Learning Based Anti-Jamming Cognitive Radio Channel Selection