JOURNAL ARTICLE

Multi-agent Multi-armed Bandit Learning for Content Caching in Edge Networks

Abstract

As a new paradigm, edge caching is deemed an effective alternative by fetching contents at the network edge. However, designing an efficient caching mechanism is challenging. First, the content library is a dynamic set rather than a static set. Second, the content may be prevalent in different small base stations (SBSs), resulting in different rewards. Thus, the above reasons require each SBS could learn its caching decisions in a multi-SBSs network. Existing reinforcement learning algorithms either fail to consider the non-stationary environment or do not provide any performance guarantee. Thus, previous algorithms work well no longer. This work proposes a multi-agent multi-armed bandit caching framework, MAMAB-C, which navigates SBSs to cache contents in a distributed manner. Specifically, we formulate the multi-SBSs caching optimization problem as an online integer linear program (ILP) and convert it into a multi-agent multi-armed bandit (MAMAB) problem with resource constraints. MAMAB-C can realize the sub-linear metric property and significantly outperform multiple state-of-the-art algorithms.

Keywords:
Computer science Cache Reinforcement learning Base station Enhanced Data Rates for GSM Evolution Set (abstract data type) Metric (unit) Linear programming Integer programming Performance metric Distributed computing Optimization problem Computer network Artificial intelligence Algorithm

Metrics

6
Cited By
1.29
FWCI (Field Weighted Citation Impact)
19
Refs
0.74
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Caching and Content Delivery
Physical Sciences →  Computer Science →  Computer Networks and Communications
Recommender Systems and Techniques
Physical Sciences →  Computer Science →  Information Systems
Advanced Bandit Algorithms Research
Social Sciences →  Decision Sciences →  Management Science and Operations Research

Related Documents

JOURNAL ARTICLE

Collaborative Multi-Agent Multi-Armed Bandit Learning for Small-Cell Caching

Xianzhe XuMeixia TaoCong Shen

Journal:   IEEE Transactions on Wireless Communications Year: 2020 Vol: 19 (4)Pages: 2570-2585
JOURNAL ARTICLE

Decentralized Multi-Agent Multi-Armed Bandit Learning With Calibration for Multi-Cell Caching

Xianzhe XuMeixia Tao

Journal:   IEEE Transactions on Communications Year: 2020 Vol: 69 (4)Pages: 2457-2472
JOURNAL ARTICLE

Federated Multi-Armed Bandit Learning for Caching in UAV-aided Content Dissemination

Amit Kumar BhuyanHrishikesh DuttaSubir Biswas

Journal:   Ad Hoc Networks Year: 2023 Vol: 151 Pages: 103306-103306
JOURNAL ARTICLE

Multi-Agent Multi-Armed Bandit Learning for Online Management of Edge-Assisted Computing

Bochun WuTianyi ChenWei NiXin Wang

Journal:   IEEE Transactions on Communications Year: 2021 Vol: 69 (12)Pages: 8188-8199
© 2026 ScienceGate Book Chapters — All rights reserved.