JOURNAL ARTICLE

Multi-Agent Reinforcement Learning for a Random Access Game

Dongwoo LeeYu ZhaoJun-Bae SeoJoohyun Lee

Year: 2022 Journal:   IEEE Transactions on Vehicular Technology Vol: 71 (8)Pages: 9119-9124   Publisher: Institute of Electrical and Electronics Engineers

Abstract

This work investigates a random access (RA) game for a time-slotted RA system, where $N$ players choose a set of slots of a frame and each frame consists of $M$ multiple time slots. We obtain the pure strategy Nash equilibria (PNEs) of this RA game, where slots are fully utilized as in the centralized scheduling. As an algorithm to realize a PNE (Pure strategy Nash Equilibrium), we propose an Exponential-weight algorithm for Exploration and Exploitation (EXP3)-based multi-agent (MA) learning algorithm, which has the computational complexity of $O(N N_{\max }^{2} T)$ . EXP3 is a bandit algorithm designed to find an optimal strategy in a multi-armed bandit (MAB) problem that users do not know the expected payoff of each strategy. Our simulation results show that the proposed algorithm can achieve PNEs. Moreover, it can adapt to time-varying environments, where the number of players varies over time.

Keywords:
Notation Nash equilibrium Reinforcement learning Stochastic game Frame (networking) Discrete mathematics Mathematics Computer science Combinatorics Algorithm Mathematical optimization Artificial intelligence Mathematical economics Arithmetic

Metrics

7
Cited By
1.40
FWCI (Field Weighted Citation Impact)
26
Refs
0.80
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Bandit Algorithms Research
Social Sciences →  Decision Sciences →  Management Science and Operations Research
Smart Grid Energy Management
Physical Sciences →  Engineering →  Electrical and Electronic Engineering
Auction Theory and Applications
Social Sciences →  Decision Sciences →  Management Science and Operations Research
© 2026 ScienceGate Book Chapters — All rights reserved.