JOURNAL ARTICLE

Coordination in multiagent reinforcement learning

Abstract

Much emphasis in multiagent reinforcement learning (MARL) research is placed on ensuring that MARL algorithms (eventually) converge to desirable equilibria. As in standard reinforcement learning, convergence generally requires sufficient exploration of strategy space. However, exploration often comes at a price in the form of penalties or foregone opportunities. In multiagent settings, the problem is exacerbated by the need for agents to "coordinate" their policies on equilibria. We propose a Bayesian model for optimal exploration in MARL problems that allows these exploration costs to be weighed against their expected benefits using the notion of value of information. Unlike standard RL models, this model requires reasoning about how one's actions will influence the behavior of other agents. We develop tractable approximations to optimal Bayesian exploration, and report on experiments illustrating the benefits of this approach in identical interest games.

Keywords:
Reinforcement learning Computer science Convergence (economics) Bayesian probability Q-learning Artificial intelligence Multi-agent system Mathematical optimization Machine learning Mathematics Economics

Metrics

154
Cited By
9.54
FWCI (Field Weighted Citation Impact)
24
Refs
0.98
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Auction Theory and Applications
Social Sciences →  Decision Sciences →  Management Science and Operations Research
Game Theory and Applications
Social Sciences →  Decision Sciences →  Management Science and Operations Research
Reinforcement Learning in Robotics
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Coordination in multiagent reinforcement learning

George ChalkiadakisCraig Boutilier

Journal:   Proceedings of the second international joint conference on Autonomous agents and multiagent systems - AAMAS '03 Year: 2003
BOOK-CHAPTER

Coordination in Multiagent Reinforcement Learning Systems

Md Abdus Samad KamalJunichi Murata

Lecture notes in computer science Year: 2004 Pages: 1197-1204
© 2026 ScienceGate Book Chapters — All rights reserved.