(T, ∊)-Greedy Reinforcement Learning for Anti-Jamming Wireless Communications

Pei-Gen Ye; Yuan‐Gen Wang; Jin Li; Liang Xiao; Guopu Zhu

doi:10.1109/globecom42002.2020.9322486

ScienceGate Book Chapters

JOURNAL ARTICLE

(T, ∊)-Greedy Reinforcement Learning for Anti-Jamming Wireless Communications

Pei-Gen Ye Yuan‐Gen Wang Jin Li Liang Xiao Guopu Zhu

Year: 2020 Pages: 1-6

DOI: 10.1109/globecom42002.2020.9322486

Get Full-Text PDF Get Analytical Report

Abstract

In this article, we propose a(τ, ε)-greedy reinforcement learning algorithm for anti-jamming wireless communications, which chooses previous action with probability τ and applies ε-greedy with probability 1-τ. The key idea of our algorithm is that the more valuable the previous action is, the higher probability of directly performing it at the current time slot without learning. For this purpose, the average utility of several previous actions is first calculated as a threshold for the valuable action judgment. Then, probability τ is formulated as a Gaussian-like function with respect to the difference between the threshold and the utility of the previous action, which makes the wireless devices find the optimal action at a faster speed in the early stage, and eventually ensures the convergence. As a concrete example, the proposed algorithm is implemented in a wireless communication system against multiple jammers. Simulation results show that compared with ε-greedy, the (τ, ε)-greedy obtains faster convergence rate and slightly higher signalto-interference-plus-noise ratio when being applied to Qlearning, deep Q-networks (DQN), double DQN (DDQN), and prioritized experience reply based DDQN (PDDQN). The source code is available at https://github.com/GZHUDVL/tau-epsilon-greedy-RL.

Keywords:

Computer science Greedy algorithm Wireless Reinforcement learning Jamming Convergence (economics) Key (lock) Wireless network Action (physics) Mathematical optimization Algorithm Artificial intelligence Mathematics Telecommunications

Metrics

Cited By

0.88

FWCI (Field Weighted Citation Impact)

Refs

0.76

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Wireless Communication Security Techniques

Physical Sciences → Engineering → Electrical and Electronic Engineering

Adversarial Robustness in Machine Learning

Physical Sciences → Computer Science → Artificial Intelligence

Security in Wireless Sensor Networks

Physical Sciences → Computer Science → Computer Networks and Communications

(T, ∊)-Greedy Reinforcement Learning for Anti-Jamming Wireless Communications

Abstract

Metrics

Citation History

Topics

Related Documents

Deep Reinforcement Learning Based Hopping Strategy for Wideband Anti-Jamming Wireless Communications

Reinforcement Learning Based Jamming Detection for Reliable Wireless Communications

Reinforcement Learning-Based Wireless Communications Against Jamming and Interference

Reinforcement Learning-Based Wireless Communications Against Jamming and Interference

Reinforcement Learning Based Environment-Aware V2I Anti-Jamming Communications