JOURNAL ARTICLE

Potential-based difference rewards for multiagent reinforcement learning

Sam DevlinLogan YliniemiDaniel Kudenko⋆Kagan Tumer

Year: 2014 Journal:   Adaptive Agents and Multi-Agents Systems Pages: 165-172

Abstract

Difference rewards and potential-based reward shaping can both significantly improve the joint policy learnt by multiple reinforcement learning agents acting simultaneously in the same environment. Difference rewards capture an agent's contribution to the system's performance. Potential-based reward shaping has been proven to not alter the Nash equilibria of the system but requires domain-specific knowledge. This paper introduces two novel reward functions that combine these methods to leverage the benefits of both.Using the difference reward's Counterfactual as Potential (CaP) allows the application of potential-based reward shaping to a wide range of multiagent systems without the need for domain specific knowledge whilst still maintaining the theoretical guarantee of consistent Nash equilibria.Alternatively, Difference Rewards incorporating Potential-Based Reward Shaping (DRiP) uses potential-based reward shaping to further shape difference rewards. By exploiting prior knowledge of a problem domain, this paper demonstrates agents using this approach can converge either up to 23.8 times faster than or to joint policies up to 196% better than agents using difference rewards alone.

Keywords:
Reinforcement learning Leverage (statistics) Temporal difference learning Counterfactual thinking Computer science Artificial intelligence Nash equilibrium Range (aeronautics) Reward system Domain (mathematical analysis) Machine learning Mathematical optimization Mathematics Engineering Psychology Social psychology

Metrics

79
Cited By
3.38
FWCI (Field Weighted Citation Impact)
23
Refs
0.94
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Reinforcement Learning in Robotics
Physical Sciences →  Computer Science →  Artificial Intelligence
Traffic control and management
Physical Sciences →  Engineering →  Control and Systems Engineering
Advanced Software Engineering Methodologies
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Learning Individual Potential-Based Rewards in Multiagent Reinforcement Learning

Chen YangPei XuJunge Zhang

Journal:   IEEE Transactions on Games Year: 2024 Vol: 17 (2)Pages: 334-345
JOURNAL ARTICLE

Modeling difference rewards for multiagent learning

Scott ProperKagan Tumer

Journal:   Adaptive Agents and Multi-Agents Systems Year: 2012 Pages: 1397-1398
BOOK-CHAPTER

Multiagent Reinforcement Learning Algorithm Using Temporal Difference Error

SeungGwan Lee

Lecture notes in computer science Year: 2005 Pages: 627-633
JOURNAL ARTICLE

Approximate Difference Rewards for Scalable Multigent Reinforcement Learning.

Arambam James SinghAkshat KumarHoong Chuin Lau

Journal:   Autonomous Agents and Multi-Agent Systems Year: 2021 Pages: 1655-1657
© 2026 ScienceGate Book Chapters — All rights reserved.