Multi-agent Multi-armed Bandit Learning for Content Caching in Edge Networks

Lina Su; Ruiting Zhou; Ne Wang; Junmei Chen; Zongpeng Li

doi:10.1109/icws55610.2022.00018

ScienceGate Book Chapters

JOURNAL ARTICLE

Multi-agent Multi-armed Bandit Learning for Content Caching in Edge Networks

Lina Su Ruiting Zhou Ne Wang Junmei Chen Zongpeng Li

Year: 2022 Pages: 11-16

DOI: 10.1109/icws55610.2022.00018

Get Full-Text PDF Get Analytical Report

Abstract

As a new paradigm, edge caching is deemed an effective alternative by fetching contents at the network edge. However, designing an efficient caching mechanism is challenging. First, the content library is a dynamic set rather than a static set. Second, the content may be prevalent in different small base stations (SBSs), resulting in different rewards. Thus, the above reasons require each SBS could learn its caching decisions in a multi-SBSs network. Existing reinforcement learning algorithms either fail to consider the non-stationary environment or do not provide any performance guarantee. Thus, previous algorithms work well no longer. This work proposes a multi-agent multi-armed bandit caching framework, MAMAB-C, which navigates SBSs to cache contents in a distributed manner. Specifically, we formulate the multi-SBSs caching optimization problem as an online integer linear program (ILP) and convert it into a multi-agent multi-armed bandit (MAMAB) problem with resource constraints. MAMAB-C can realize the sub-linear metric property and significantly outperform multiple state-of-the-art algorithms.

Keywords:

Computer science Cache Reinforcement learning Base station Enhanced Data Rates for GSM Evolution Set (abstract data type) Metric (unit) Linear programming Integer programming Performance metric Distributed computing Optimization problem Computer network Artificial intelligence Algorithm

Metrics

Cited By

1.29

FWCI (Field Weighted Citation Impact)

Refs

0.74

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Caching and Content Delivery

Physical Sciences → Computer Science → Computer Networks and Communications

Recommender Systems and Techniques

Physical Sciences → Computer Science → Information Systems

Advanced Bandit Algorithms Research

Social Sciences → Decision Sciences → Management Science and Operations Research

Multi-agent Multi-armed Bandit Learning for Content Caching in Edge Networks

Abstract

Metrics

Citation History

Topics

Related Documents

Collaborative Multi-Agent Multi-Armed Bandit Learning for Small-Cell Caching

Decentralized Multi-Agent Multi-Armed Bandit Learning With Calibration for Multi-Cell Caching

Federated Multi-Armed Bandit Learning for Caching in UAV-aided Content Dissemination

Multi-Agent Multi-Armed Bandit Learning for Online Management of Edge-Assisted Computing

Multi-Agent Reinforcement Learning Based Cooperative Content Caching for Mobile Edge Networks